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within a short reaction time and also to provide a 
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This invention relates to a thermostable DNA 
polymerase having at least 30 bases/second of DNA 
extension rate and a 3'-5' exonuclease activity derived 
from an hyperthermophilic archaeon strain KOD1, to a 
method of amplifying and also of detecting the nucleic 
acid using said enzyme and to a reagent kit used for 
those methods. 
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Description 

The present invention relates to a method of amplifying nucleic acid wherein DNA or RNA is amplified within a short 
reaction time and with a high fidelity, to a method of identifying nucleic acid utilizing said amplifying method and to a 
DNA polymerase and a reagent kit used for those methods. 

Many studies have been made already for DNA polymerase of mesophilic microorganisms such as Escherichia coli 
and for DNA polymerase derived from phages infectable by the mesophilic microorganisms. In addition, many studies 
have been also made already for heat stable DNA polymerases which are useful in a recombinant DNA technique by 
means of nucleic acid amplification such as a polymerase chain reaction (PCR). Examples of the heat-stable polymer- 
ases which are used for the PCR are DNA polymerase (Tth polymerase) mostly derived from Thermus thermophilus 
and DNA polymerase (Taq polymerase) derived from Thermus aquaticus . Other known examples are DNA polymerase 
(Pfu polymerase) derived from Pvrococcus furiosus and DNA polymerase (Vent polymerase) derived from Thermococ- 
cus litoralis . 

However, with the Taq polymerase, fidelity and thermostability upon the synthesis of DNA are not sufficient. 
Although the Pfu polymerase exhibiting excellent fidelity and thermostability has been developed, said Pfu polymerase 
has some problems that its DNA extension rate is slow and a processivity is low whereby it has been used only for a 
specific PCR. 

Recently, a PCR whereby 20 kb or more DNA is amplified (hereinafter, referred to as a long-PCR) has been devel- 
oped. In said long-PCR, both Taq polymerase and Pfu polymerase are mixed whereby properties of both enzymes are 
utilized. However, when two enzymes having different properties are used in the same reaction system, some discrep- 
ancies might occur in their appropriate reaction conditions whereby there is a question whether the high extension rate 
and fidelity which are the advantages of each of those enzymes can be still maintained. Moreover, because of the dif- 
ference in the thermostabilities and in the composition of the stock solutions of both enzymes, there is a question as to 
the stability when they are stoked in the same container. 

In view of the above, there has been a keen demand for novel thermostable polymerase which exhibits both of 
those advantages. 

The present inventors have succeeded in preparing a thermostable DNA polymerase from a hyperthermophilic 
archaeon strain KOD1 , and, when its properties are investigated, it has been found that said DNA polymerase exhibits 
the advantages of the above-mentioned two enzymes, i.e. high extension rate and high fidelity, whereby the present 
invention has been achieved. 

Thus, the present invention relates to a method for amplifying a target nucleic acid comprises reacting the target 
nucleic acid with four kinds of dNTP and primer complementary to said target nucleic acid in a buffer solution which con- 
tains a thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3'-5'exonuclease 
activity such that the above mentioned primer is annealed to the target nucleic acid and an extention product is synthe- 
sized from the primer. 

The present invention further relates to a method for amplifying a target nucleic acid in a sample wherein each tar- 
get nucleic acid consists of two separate complementary strands which comprises the following steps A to D, charac- 
terized in that a thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3'-5* 
exonuclease activity is used as a thermostable DNA polymerase; 

A: modifying the target nucleic acid, if necessary, to produce single-stranded nucleic acids; 
B: reacting the single-stranded nucleic acids with four kinds of dNTP and primers, wherein said primers are 
selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, in a 
buffer solution which contains a thermos table DNA polymerase such that the above mentioned primers are 
annealed to the Single-stranded nucleic acids and extention products are synthesized from the primers, 
C: separating the primer extention products from the templates on which they are synthesized to produce single- 
stranded nucleic acids; and 

D: repeatedly conducting the above mentioned steps B and C. 

The present invention further relates to a method for detecting a target nucleic acid in a sample wherein each target 
nucleic acid consists of two separate complementary strands which comprises the following steps A to E, characterized 
in that a thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3' -5' exonucle- 
ase activity is used as a thermostable DNA polymerase; 

A: modifying the target nucleic acid, if necessary, to produce single-stranded nucleic acids; 
B: reacting the single-stranded nucleic acids with four kinds of dNTP and primers, wherein said primers are 
selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, in a 
buffer solution which contains a thermostable DNA polymerase such that the above mentioned primers are 
annealed to the single-stranded nucleic acids and extention products are synthesized from the primers, 
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C: separating the primer extention products from the templates on which they are synthesized to produce angle- 
stranded nucleic acids; 

D: repeatedly conducting the above mentioned steps 6 and C, and 
E: detecting an amplified nucleic acid. 

c 

The present invention further relates to a reagent kit for amplifying target nucleic acid which comprises primers, 
wherein said primers are selected so as to be sufficiently complementary to different strands of target nucleic acid to 
anneal therewith, four kinds of dNTP. divalent cation, thermostable DNA polymerase having a DNA extension rate of at 
least 30 bases/second and a 3'-5' exonuclease activity and buffer solution. 
10 The present invention further relates to a reagent kit for detecting target nucleic acid which comprises primers, 
wherein said primers are selected so as to be sufficiently complementary to different strands of target nucleic acid to 
anneal therewith, four kinds of dNTP, divalent cation, thermostable DNA polymerase having a DNA extension rate of at 
least 30 bases/second and a 3 -5' exonuclease activity, amplifying buffer solution, a probe capable of hybridizing with 
amplified nucleic acid and a detection buffer solution. 
is The present invention relates to a thermos table DNA polymerase which is obtainable from a strain KOD1 which 
belongs to a hyperthermophilic archaeon strain. 

The present invention relates to an isolated DNA comprising a nucleotide sequence that encodes the thermostable 
DNA polymerase derived from a KOD1 strain which belongs to hyperthermophilic archaeon. 

Preferably, said DNA comprises the nucleotide sequence of SEQ ID No. 5 or a nucleotide sequence encoding the 
20 thermostable polymerase which has the amino acid sequence of SEQ ID No. 1 . 

The present invention also relates to a DNA encoding a polypeptide with the above mentioned biological activities 
comprising a nucleotide sequence 

(a) which differs from the above DNA in codon sequence due to the degeneracy of the genetic code; 
25 (b) which hybridizes with the above DNA or the DNA of section (a); or 
(c) represents a fragment, allelic or other variation of the above DNA. 

In this context, the term "hybridization" refers to conventional hybridization conditions (see for example Nucleic acid 
hybridization, A practical approach. Hames and Higgins, eds.. IRL Press, Oxford Washington DC. 1985). Most pref era- 
30 bly, the term "hybridization" refers to stringent hybridization conditions. 

The present invention also relates to a thermostable DNA polymerase with the above mentioned biological activi- 
ties and being encoded by one of said DNA's and, in addition, to a kit comprising said thermostable DNA polymerase. 

The present invention further relates to a recombinant DNA expression vector that comprises the DNA sequence 
inserted into a vector, wherein the DNA sequence encodes the thermostable DNA polymerase derived from a KOD1 
35 strain which belongs to hypertfiermophilic archaeon. 

The present invention further relates to a transformed recombinant host cell using a recombinant DNA expression 
vector that comprises the DNA sequence inserted into a vector, wherein the DNA sequence encodes the thermos table 
DNA polymerase derived from a KOD1 strain which belongs to hyperthermophilic archaeon. 

The present invention relates to a method for producing a DNA polymerase obtainable from a KOD1 strain which 
40 belongs to hyperthermophilic archaeon. comprises cufturing recombinant host cells which is transformed by a recom- 
binant DNA expression vector that comprises the DNA sequence inserted into a vector, wherein the DNA sequence 
encodes the thermostable DNA polymerase derived from a KOD1 strain which belongs to hyperthermophilic archaeon. 
and recovering the produced thermostable DNA polymerase. 

The present invention further relates to a method for purifying the DNA polymerase obtainable from a KOD1 strain 
45 which belongs to hyperthermophilic archaeon. comprises cufturing the recombinant host cells which is transformed by 
a recombinant DNA expression vector that comprises the DNA sequence inserted into a vector, wherein the DNA 
sequence encodes the thermos table DNA polymerase derived from a KOD 1 strain which belongs to hyperthermophilic 
archaeon, and further (a) recovering the cultured recombinant host cells, disintegrating them and preparing the cell 
extract, and (b) removing the impurified proteins derived from recombinant host cells. 
so The nucleic acid which is to be amplified by the present invention is DNA or RNA. There is no restriction at all for 
the sample in which such a nucleic acid is contained. 

The thermostable enzyme which is used in the present invention is a thermostable DNA polymerase having at least 
30 bases/second of DNA extension rate and having a 3*-5' exonuclease activity. Its specific example is a DNA polymer- 
ase derived from a hyperthermophilic archaeon strain KOD1 (called a KOD polymerase) and said enzyme may be 
55 either a thermostable enzyme purified from nature or an enzyme manufactured by a gene recombination technique. 

The DNA extension rate in the present invention is calculated from the relationship between the reaction time and 
the size of the synthesized DNA in the reaction of various kinds of DNA polymerases such as KOD, Pfu, Deep Vent. 
Taq. etc. (5U) in each buffer using a substrate prepared by annealing a single-stranded DNA (1.6 \iq) of M13 with a 
primer (16 pmoles) complementary thereto. It is essential in the present invention that the DNA extension rate is at least 
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30 bases/second. 

The DNA extension rates for each of the polymerases are 105-130 bases/second for KOD polymerase, 24.8 
bases/second for Pfu polymerase, 23.3 bases/second for Deep Vent polymerase and 61.0 bases/second for Taq 
polymerase. 

5 On the other hand, it is essential in the present invention that the thermostable DNA polymerase has a 3'-5' exonu- 
clease activity. 

In the present invention, the 3'-5' exonuclease activity is determined by checking the rate of release of 3 H under the 
optimum condition for each polymerase using a substrate wherein the 3'-end of the lambda-DNA digested with Hindlll 
labeled with [ 3 H]TTR 

10 In the 3-5' exonuclease activity of each polymerase, free- 3 H is found to be only 1 0-20% in the case of Taq polymer- 
ase and Tth polymerase after an incubation period of three hours, in KOD polymerase and Pfu polymerase, it is 50- 
70%. 

It has been confirmed that the KOD polymerase used in the Present invention has a 3'-5' exonuclease activity and 
that, in the gene which codes for KOD polymerase, there is a DNA conserved sequence showing a 3'-5' exonuclease 
is activity the same as in the case of Pfu polymerase. 

In the present invention, the fact whether there is a 3'-5' exonuclease activity is checked in such a manner that KOD 
polymerase is allowed to stand, using a DNA fragment into which the DNA of [ 3 H]TTP-labelled-lambda-DNA digested 
with Hindlll is incorporated as a substrate, at the reaction temperature of 75°C in a buffer (20mM Tris-HCI of pH 6.5, 
10mM KCI. 6mM (NH 4 ) 2 S0 4 . 2mM MgCI 2 , 0.1% Triton X-100 and 10 ng/ml BSA) and the ratio of the free-[ 3 H]TTP is 
20 determined. 

At the same time, Taq polymerase and Tth polymerase having no 3'-5' exonuclease activity and Pfu polymerase 
having a 3'-5* exonuclease activity were checked using a buffer for each of them by the same manner as in the control 
experiments. The titer of each of the used polymerases was made 2.5 units. 

The substrate DNA was prepared in such a manner that, first, 0.2 mM of dATR dGTP, dCTP and [ 3 H]TTP were 
25 added to 1 0 ng of lambda-DNA digested with Hindlll, the 3'-end was elongated by Klenow polymerase, then DNA frag- 
ments were recovered by extracting with phenol and precipitating with ethanol and free mononucleotides were removed 
by a Spin column (manufactured by Clontech). 

In the case of KOD polymerase and Pfu polymerase, 50-70% of free [ 3 H]TTP were detected after an incubation 
period of three hours, in the case of Taq polymerase and Tth polymerase, only 10-20% of free [ 3 H]TTP was noted. 
30 It is preferred that said thermostable DNA polymerase contains an amino acid sequence given in SEQ ID No.1 . 

It is also preferred that said thermostable DNA polymerase is an enzyme having the following physical and chemi- 
cal properties. 

Action: It has a DNA synthetic activity and a 3'-5* exonuclease activity. 
DNA extension rate: at least 30 bases/second 
35 Optimum pH: 6.5-7.5 (at 75°C) 

Optimum temperature: 75°C 
Molecular weight: about 88-90 Kda 
Amino acid sequence: as mentioned in SEQ ID No.1 
An example of the methods for manufacturing DNA polymerase derived from a hyperthermophilic archaeon strain 
40 KODl is that thermostable DNA polymerase gene was cloned from strain KOD1 which was isolated from a solfatara at 
a wharf on Kodakara island, Kagoshima so that a recombinant expression vector was constructed, then a transformant 
prepared by transformation by said recombinant vector was cultured and the thermostable DNA polymerase was col- 
lected from the culture followed by purifying. 

In the present invention, the DNA polymerase derived from the above-mentioned hyperthermophilic archaeon 
45 strain KOD1 has a DNA synthesizing activity and a 3'-5' exonuclease activity and has a DNA extension rate of at least 
30 bases/second. This property is used for conducting an amplification of nucleic acid 
The amplifying method of the present invention includes the following steps A to D. 

A: modifying the target nucleic acid, if necessary, to produce single-stranded nucleic acids; 

so B: reacting the single-stranded nucleic acids with four kinds of dNTP and primers, wherein said primers are 
selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, in a 
buffer solution which contains a thermostable DNA polymerase such that the above mentioned primers are 
annealed to the single-stranded nucleic acids and extention products are synthesized from the primers, 
C: separating the primer extention products from the templates on which they are synthesized to produce single- 

55 stranded nucleic acids; and 

D: repeatedly conducting the above mentioned steps B and C. 

In the step A, the target nucleic acid is denatured if necessary to give a single-stranded nucleic acid. The means 
therefor may be a thermal treatment a chemical denaturation or an nzymatic treatment. Preferably, it is a thermal 
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treatment. 

In the step B, said single-stranded nucleic acid is made to react with four kinds of dNTP (dATP, dGTP, dCTP and 
dTTP or dUTP) and primers with regular and inverted directions having complementary base sequences to the target 
nudeic acid in a buffer solution containing a thermostable DNA polymerase so that said primers are annealed to the 
5 single-stranded nucleic acid to conduct a primer exterrtion reaction. 

A primer with a regular direction and that with an inverted direction having complementary base sequences to the 
target nucleic acid are oligonucleotides having a base sequence which is complementary to one of target nucleic acid 
and is homologous to another. Accordingly, one primer may be complementary to another primer elongate. 

Preferred buffer solutions containing a thermostable DNA polymerase are Tris buffers containing divalent cation 
w such as magnesium ion. 

An example of the conditions for conducting an elongation reaction by annealing the primer is a method in which a 
cycle of 98°C/1 second- 1 minute and 68°C/1 second- 10 minutes is repeated for 30 times. 

The step of separating an elongated primer for making a single strand in the step C may be a thermal treatment, a 
chemical treatment or an enzymatic treatment. Preferably, it is a thermal treatment or an enzymatic treatment using 
is RNase. 

In the step D, the above-mentioned steps B and C are repeated. To be more specific, it is preferred that heating and 
cooling of 98°C/20 seconds and 68°C/30 seconds are repeated at least for 30 cycles. 

An amplifying method of the present invention is applicable to a PCR for amplifying a DNA of 20 kb or more (here- 
inafter, referred to as a long-PCR) as well. In this long-PCR. advantages of both high DNA extension rate of Taq 
20 polymerase and high fidelity in DNA synthesis caused by a 3-5* exonuclease activity of Pfu polymerase are necessary 
and both enzymes are used after mixing them. In this case, there is a question on a stability when both enzymes are 
stored in the same container because of the difference between their thermostabilities and that between the composi- 
tions of their stored solutions. However, in the DNA polymerase derived from a hyperthermophilic archaeon strain 
KOD1 , a single enzyme exhibits both high DNA extension rate and high fidelity due to its 3\-5' exonuclease activity 
25 whereby it is possible that a long-PCR can be conducted by its sole use. 

In the present invention, the amplified product produced by the above-mentioned amplification such as a labeled 
probe is used whereby a target nucleic acid can be detected. 

Labeled probe is an oligonucleotide having a base sequence which is complementary to a target nucleic acid and 
is bonded with a labeled substance or a labeled binding substance. 
30 Examples of the labeled substance are enzymes such as alkaline phosphatase, peroxidase and galactosidase, flu- 
orescent substances and radioactive substances while examples of the labeled binding substances are biotin and dig- 
oxigenin. Labeled substance may be bonded via biotin, digoxigenin or avidia 

A method of introducing those labels into a probe is that, during the synthesis of oligonucleotide, dNTP to which 
those labeled substances or labeled binding substances are bonded is used as one of the components of dNTP 
35 whereby a synthesis is conducted. 

Examples of detecting a nucleic acid bonded with a labeled probe are conventionally known methods such as a 
Southern hybridization and a Northern hybridization. In those methods, the fact that a hybrid is formed when single- 
stranded DNA and RNA are complementary each other is utilized whereby unknown nucleic acid fraction group is sub- 
jected to an agarose electrophoresis to separate its size, then the nucleic acid fraction in the gel is subjected, for exam- 
40 pie, to an alkali treatment, the resulting single strand is transferred to a filter, immobilized and hybridized with a labeled 
probe. 

As to a detection of the label in case an alkaline phosphatase is used as a labeled substance, when a chemolumi- 
nescent substrate such as a 1 ,2-dioxetane compound (PPD) is made to react therewith, only nucleic acid forming a 
hybrid is illuminated. This is sensitized to an X-ray film whereby the size of the target nucleic acid and its position on 
45 electrophoresis can be confirmed. 

A reagent kit for nucleic acid amplification according to the present invention contains primers of regular and 
inverted directions having base sequences complementary to target nucleic acid, four kinds of dNTP, divalent cation, 
thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and having a exonucle- 
ase activity and a buffer solution. 
so An example of divalent cation is magnesium ion. Its concentration is preferably about 1-3 mM. Examples of the 
buffer solution are tris buffer (pH 6.5. 75°C) and tricine buffer (pH 6.5, 75°C). 
A specific example of the composition is as follows: 
20mM Tris-HCI (pH 6.5. 75°C) 
10mM KCI 
55 6mM (NH 4 ) 2 S0 4 

1-3mMMgC»2 
0.1% Triton X-100 
10 Mg/ml BSA 
20-200 jiM dNTPs 



5 



EP 0 745 675 A2 



0.1pM-1pM primer 

0.1 -250ng template DNA. 

A reagent kit for nucleic acid amplification according to the present invention contains a nucleic acid amplifying rea- 
gent comprising primers of regular and inverted directions having base sequences complementary to target nucleic 
acid, four kinds of dNTP, divalent cation, thermostable DNA polymerase having a DNA extension rate of at least 30 
bases/second and having a 3'-5' exonuclease activity and a buffer solution for amplification, a target nucleic acid probe 
and a buffer for detection. The buffer for detection is that the detecting reagent varies depending upon the label. For 
example, it includes a color reagent or a luminous reagent. 

KOD1 which is a kind of hyperthermophilic archaeon used in the present invention is a strain isolated from a solfa- 
tara at a wharf on Kodakara Island, Kagoshima. 

Mycological properties of said strain are as follows. 
Shape of cells: coccus, diplococcus; having flagella. 
Temperature range for the growth: 65-l00°C 
Optimum temperature for the growth: 95°C 
pH range for the growth: 5-9 
Optimum pH: 6 

Optimum salt concentration: 2-3% 
Auxotrophy: heterotrophic 
Oxygen demand: aerophobic 
Cell membrane lipids: ether type 
GC content of DNA: 38% 

The hyperthermophilic archaeon strain KOD1 was a coccus having a diameter of about 1 urn and had plural polar 
flagella. From the mycological properties of the strain, its close relationship with Pfu DNA polymerase-productive bac- 
terium (Pvrococcus furiosus) and with Tli (Vent) DNA polymerase-productive bacterium (Thermococcus Ittoralis) was 
suggested. 

Cloning of the thermostable DNA polymerase gene of the present invention is carried out as follows. 

The cloning method is that a primer is designed and synthesized depending upon an amino acid sequence in a 
conserved region of Pfu DNA polymerase (Nucleic Acids Research, 1993. vol.21, No.2, 259-265). 

First, a PCR is conducted using the above-prepared primers (e.g., SEQ ID Nos.7 and 8) taking chromosomal DNA 
of the hyperthermophilic archaeon strain KOD1 as a template to amplify the DNA fragment. The DNA sequence (e.g., 
SEQ ID No.9) of the amplified fragment is determined and, after confirming that the originally set amino acid sequence 
is coded for, a Southern hybridization is conducted to the cleaved product of the chromosomal DNA with a restriction 
enzyme using said fragment as a probe. It is preferred that the approximate size of the fragment containing the aimed 
DNA polymerase gene is limited to about 4-7 Kbp. 

Then DNA fragment of about 4-7 Kbp is recovered from the gel, a DNA library is prepared by Escherichia coli using 
said fragment and a colony hybridization is carried out using the above-mentioned PCR-amplrfied DNA fragment (e.g., 
SEQ ID No.9) to collect a clone strain. 

The DNA polymerase gene of the strain KOD1 cloned in the present invention is composed of 5010 bases (esti- 
mated numbers of amino acids: 1670) (SEQ ID No.5). 

Upon comparison with other DNA polymerases, there is a conserved region of aDNA polymerase which is an 
eukaryote type (Regions 1-5) in the gene of the present invention. In addition, there are EXO 1 ,2,3 which are 3'->5' exo- 
nuclease motives at the N terminal of said gene. In the conserved regions (Regions 1, 2) of the thermostable DNA 
polymerase gene derived from the hyperthermophilic archaeon strain KOD1, each of the intervening sequences is 
present and they are connected in a form where the open reading frame (ORF) is conserved. 

When the thermostable DNA polymerase gene of the hyperthermophilic archaeon strain KOD1 is compared with 
Pfu DNA polymerase gene derived from Pvrococcus furiosus (Japanese Laid-Open Patent Publication Hei-05/328969) 
and with Tli (Vent) DNA polymerase gene derived from Thermococcus litoralis (Japanese Laid-Open Patent Publication 
He-06/7160) which are known enzymes, intervening sequence is present in the gene of the strain KOD1 of the present 
invention while there is no intervening sequence in the gene of the above-mentioned Pfu DNA polymerase and, in the 
Tli DNA polymerase gene, there are two kinds of intervening sequences but they are present within Regions 2 and 3 
which are conserved regions and that greatly differs from the location where the intervening sequence in the thermosta- 
ble DNA polymerase gene of KOD1 strain of the present invention exists ( see Fig. 7). 

The gene of the present invention is a DNA which codes for the DNA polymerase derived from the hyperther- 
mophilic archaeon strain KOD1 . An example of said DNA contains a base sequence which codes for the amino acid 
sequence mentioned in SEQ ID No. 1 or 5. Further, such a DNA contains a base sequence mentioned in SEQ ID No. 
5 or 6 or a part thereof. 

In order to express the thermos table DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 
of the present invention in Escherichia cq|L the intervening sequences of 1374-2453 bp and 2708-4316 bp in the base 
sequence shown by SEQ ID No.5 are removed by means of a PCR gene fusion to construct a DNA polymerase gene 
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of a complete form. To be specific, a PGR is conducted on a cloned gene containing the intervening sequence by a 
combination of three pairs of primers to amplify the three fragments which are divided by the intervening sequence. In 
designing the primers used here, a part of the fragment which is to be bonded to its terminal is contained in its 5*-end. 
Then a PCR is conducted using the fragments to be bonded utilizing the duplicated sequence of the terminal whereby 

5 each of the fragments is bonded. Further PCR is conducted by the same manner using the resulting two kinds of frag- 
ments to give a DNA polymerase gene in a complete form containing no DNA polymerase gene derived from the strain 
KOD1 containing no intervening sequence. 

Any vector may be used in the present invention so far as it makes cloning and expression of the thermostable DNA 
polymerase derived from KOD1 possible, e.g. a phage or plasmid. An example of the plasmid is a plasmid vector 

w wherein an expression induced by T7 promoter is possible such as pET-8c. Other examples of the plasmid are pUC1 9, 
pBR322, pBluescript, pSP73, pGW7 t pET3A and pETt 1C and so on. Examples of the phage are lambd gtn, lambda 
DASH and lambda Zapll and so on. 

Examples of the host cell used in the present invention are Escherichia coli and yeasts. Examples of Escherichia 
SOliare JM109, 101, XL1, PR1 and BL21(DE3)pysS and soon. 

is In the present invention, the gene coding for the thermostable DNA polymerase derived from the above-mentioned 
KOD1 is inserted into the above-mentioned vector to give a recombinant vector and the host cell is subjected to a trans- 
formation using said recombinant vector. 

In the producing method of the present invention, the above-mentioned recombinant host cell is cultured whereby 
the thermostable DNA polymerase gene derived from the strain KOD1 is induced and expressed. The culture medium 

20 used for the culture of the recombinant host cell and the conditions therefor follow the conventional methods. 

In a specific example, Escherichia coli which is transformed by pET-8c plasmid containing a DNA polymerase gene 
in a complete form containing no intervening sequence derived from the strain KOD1 is cultured, for example, in a TB 
medium whereby an induction treatment is conducted. It is preferred that the induction treatment of T7 promoter is car- 
ried out by addition of isopropylthto-p-D-galactoside. 

25 The purifying method of the present invention includes, after culturing the recombinant host cells, a step wherein 
(a) recombinant host cells are collected, disintegrated and the cell extract is prepared and a step wherein (b) impure 
protein derived from the host cells is removed. 

The thermostable DNA polymerase which is produced from the recombinant host cells is separated and recovered 
from the culture liquid by means of centrifugation or the like after culturing the host bacterial ceils in a medium followed 

30 by inducing. After said bacterial cells are resuspended in a buffer, they are disintegrated by means of ultrasonic treat- 
ment. Dyno mill. French press, etc. Then a thermal treatment is conducted and the heat stable DNA polymerase is 
recovered from the supernatant fluid. In disintegrating the bacterial cells, ultrasonic treatment, Dyno mill and French 
press method are preferred. 

A thermal treatment is preferred as one of the steps for removing the impure protein derived from the host cells. 
35 The condition for the thermal treatment is at 70°C or higher or, preferably, at 90° or higher. Other means for removing 
the impure protein are various chromatographic techniques. 

Molecular weight of the thermostable DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 
obtained as such is about 90 KDa (cf. Fig. 5). 

When a polymerase chain reaction is conducted using said thermostable DNA polymerase, a sufficient amplrfica- 
40 tion of the aimed DNA fragments is confirmed (cf. Fig. 6). 

Now the present invention will be illustrated by referring partly to the drawings wherein: 

Fig. 1 is a photographic picture of electrophoresis as a substitute for a drawing and shows the result of the meas- 
urement of the DNA extension rate of the KOD polymerase; 

45 Fig. 2 is a photographic picture of electrophoresis as a substitute for a drawing and shows the comparison of the 
DNA extension rate of various thermostable DNA polymerases in which Fig. 2a shows the cases of KOD polymer- 
ase and Pfu polymerase while Fig. 2b shows the cases of Deep Vent polymerase and Taq polymerase; 
Fig. 3 is a photographic picture of electrophoresis as a substitute for a drawing and shows the comparison of the 
PCR due to the difference in the reaction time of various thermostable DNA polymerase; 

so Fig. 4 shows the constructive charts of the recombinant expression vector; 

Fig. 5 is a photographic picture of electrophoresis as a substitute for a drawing and shows the result of the meas- 
urement of molecular weight of the thermostable DNA polymerase derived from KOD1 ; 

Fig. 6 is a photographic picture of electrophoresis as a substitute for a drawing and shows the result of the PCR by 
the thermostable DNA polymerase derived from KOD1 ; and 
55 Fig. 7 shows drawings which show a comparison of the DNA polymerase gene derived from the hyperthermophilic 
archaeon strain KOD1 with the thermostable DNA polymerase gene derived from Pyrococcus furiosus and that 
derived from Thermococcus Irtoralis which are thought to be similar bacteria 
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Example 1 . 

Cloning of DNA Polymerase Gene Derived from hyperthermophilic archaeon strain KOD1 

The hyperthermophilic archaeon strain KOD1 isolated in Kodakara Island, Kagoshima was cultured at 95°C and 
then the bacterial cells were recovered. Chromosomal DNA of the hyperthermophilic archaeon strain KOD1 was pre- 
pared by a conventional method from the resulting bacterial cells. 

Two kinds of primers (5'-GGATTAGTATAGTGCCAATGGAAGGCGAC-3' [SEQ ID No.7] and S'-GAGGGCGAAGTT- 
TATTCCGAGCTT-3' [SEQ ID No.8]) were synthesized based upon the amino acid sequence at the conserved region of 
the DNA polymerase (Pfu polymerase) derived from Pyrococcus furiosus . A PCR was carried out using those two prim- 
ers where the prepared chromosomal DNA was used as a template. 

After the base sequence (SEQ ID No.9) of the PCR-amplified DNA fragment was determined and the amino acid 
sequence (SEQ ID No. 10) was determined, a Southern hybridization was conducted using said amplified DNA frag- 
ment to the product of the strain KOD1 chromosomal DNA treated with a restriction enzyme whereby the size of the 
fragment coding for the DNA polymerase was calculated (about 4-7 Kbp). Further, the DNA fragment of this size was 
recovered from agarose gel, inserted into a plasmid pBS (manufactured by Stratagene) and Escherichia coli (E. coli JM 
109) was transformed by this mixture to prepare a library. 

A colony hybridization was conducted using a probe (SEQ ID No.9) used for the Southern hybridization to obtain a 
clone strain (E. cdi JM1 09/pBSK0D1) which is thought to contain the DNA polymerase gene derived from strain KOD1 . 

Example 2. 

Determination of Base Sequence of t he cloned Fragment 

A plasmid pBSKODI was recovered from the clone strain E. coli JM109/pBSKOD1 obtained in Example 1 and its 
base sequence (SEQ ID No.5) was determined by a conventional method. Further, the amino acid sequence was 
derived from the determined base sequence. The DNA polymerase gene derived from KOD1 strain comprised 5010 
bases wherein 1670 amino acids were coded. 

Example 3. 

Construction of Recombinant Expression Vector 

In order to prepare a complete polymerase gene, the intervening sequence parts at two places (1 374-2453 bp and 
2708-4316 bp) were removed by a PCR fusion method. In the PCR fusion method, three pairs of primers (SEQ ID 
Nos. 11-16) were combined using a primer recovered from the clone strain as a template and a PCR was conducted for 
each of them to amplify three fragments wherefrom the intervening sequences were removed. At that time, the primer 
used for the PCR was designed in such a manner that the side which binds to another fragment has the same sequence 
as the binding partner has. In addition, a design was conducted in such a manner that different restriction enzyme sites 
(EcoRV at the N-terminus while BamHI at the C-terminus) were created at both ends. 

After that, among the PCR-amplified fragments, that which is located at the central part of the structure and that 
which is located at the N-terminal side are mixed and a PCR was conducted using each of the fragments as a primer. 
At the same time, the fragment located at the central part of the structure and that located at the C-terminal side are 
mixed and a PCR was conducted using each of the fragments as a primer. Two kinds of fragments obtained as such 
were subjected to a PCR once again to give gene fragments in a complete form having no intervening sequence, having 
EcoRV and BamHI sites at the N- and C-termini, respectively and coding for the DNA polymerase derived from strain 
KOD1. 

Further, said gene was subcloned using an expression vector which can be induced by T7 promoter, an 
Ncol/BamHI site of pET-8c and the previously-created restriction enzyme site to give a recombinant expression vector 
(pET-pol). 

Example 4. 

Expression and Purificati on of DNA Polymerase Derived from KQD1 

Escherichia coli (BL21 (DE3)) was transformed using a recombinant expression vector (pET-pol) obtained in Exam- 
ple 3, the resulting transformant was cultured in a TB medium (mentioned in Molecular Cloning, p.A.2, 1989) and, one 
hour before collecting the bacterial cells, an induction treatment of T7 promoter was conducted by addition of isopro- 
pytthio-p-D-gaiactopyrenoside. Bacterial cells were recovered from the cultured liquid by means of centrifugation. They 
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were resuspended in a buffer and disintegrated by an ultrasonic treatment to give a cell extract. In order to remove the 
impure protein derived from the host cells, the disintegrated cell solution was treated at 94°C for 20 minutes whereby 
the impure protein derived from the host cells was isolated by centrifugation to give a thermostable DNA polymerase 
derived from strain KOD1 . 

s The Eschericia coli BL21 (DE3) pER-pol was deposited on April 22. 1996 under the Budapest Treaty at National 

Institute of Bioscience and Human-Technology Agency of Industrial Science and Technology (1-3. Higashi 1 chome 
Tsukuba-shi Ibaraki-ken 305, JAPAN) in accordance with the Budapest Treaty under the accession number FERM BP- 
5513. 

to Example 5. 

Purification of Thermostable DNA Polymerase Derived from KOD1 

Molecular weight of the thermostable DNA polymerase derived from KOD1 obtained in Example 4 was calculated 
is by means of an SDS-PAGE method whereby it was found to be about 86-92 kDa (Fig. 5). Further, a PCR was con- 
ducted using the thermostable DNA polymerase derived from KOD1 obtained in Example 4 and the known template 
primer whereupon a DNA fragment which was to be a target was confirmed (Fig. 6) by the same manner as in the case 
where the thermostable DNA polymerase derived from Thermococcus litoralis was used and a high thermostable DNA 
polymerase activity was confirmed. 

20 

Comparative Example 1 . 

Comparison with the Thermostable DNA Polymerase Gene Deri ved from Pyococcus furiosus or from Thermococcus 
litoralis which are to be Similar to the Hvperthermophilic archaeon strain KQD1 of the Present Invention. 

25 

Amino acid sequences were estimated from the DNA sequences of the DNA polymerase gene derived from the 
hvperthermophilic archaeon strain KOD1 of the present invention (SEQ ID No.6). the thermostable DNA polymerase 
gene derived from Pyrococcus furiosus (Japanese Laid-Open Patent Publication Hei-5/328969) and the thermostable 
DNA polymerase gene derived from Thermococcus litoralis (Japanese Laid-Open Patent Publication Hei-6A7160) and 

30 were compared and investigated. 

In the DNA polymerase derived from KOD1 of the present invention, there were regions 1-5 which were the con- 
served regions of aDNA polymerase of an eurokaryotic type. Further, there were EXOI , 2 and 3 which were 3'->5' exo- 
nudease motives at the N-terminal side. However, in each of the Region 1 and Region 2 which were the aDNA 
polymerase conserved regions, there were intervening sequences IVS-Aand IVS-B (refer to Fig. 7). 

35 On the other hand, in Pf u polymerase which is a thermostable DNA polymerase derived from Pyrpcoccus furiosus, 
there was no intervening sequence. In the case of Vent polymerase which is a thermostable DNA polymerase derived 
from Thermococcus litoralis. there were the intervening sequences (IVS1 and IVS2) in the aDNA polymerase con- 
served regions (Region 2 and Region 3) ( see Fig.7). 

40 Example 6. 

Measurement of DNA Extension Rate of the DNA Polymerase Derived from Hvperthermophilic archaeon strain KOD1 

DNA prepared by annealing the Ml3mp18DNA with M13P7 primer having a base sequence as mentioned in SEQ 
45 ID No.2 was used as a substrate and the rate of synthesizing the DNA in a reaction buffer solution [20 mM Tris-HQ (pH 
7.5 at 75°C), 10 mM KCI, 6 mM (NH 4 ) 2 S0 4 . 2 mM MgCI 2 . 0.1% Triton X-100 and 10 *ig/ml nuclease-free BSA] contain- 
ing the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 manufactured in Examples 1-5 was 
investigated for the reaction time of 20, 40. 60. 80 and 100 seconds (Fig. 1) or 40, 60. 80 and 100 seconds (Fig. 2). The 
results are given in Fig. 1 and in Fig. 2. 
so A part of the DNA sample during the elongation reaction was taken out for each reaction time and was added to a 
reaction stopping solution (60 mM EDTA. 60 uM NaOH. 0.1% BPB and 30% glycerol) in the same amount 

The DNA samples obtained in the above process were separated and analyzed by means of an alkaline agarose 
electrophoresis and the size of the synthesized DNA was checked. 

1 , 2, 3. 4 and 5 in Fig. 1 show.the results of the reactions for 0.3 minute (20 seconds). 0.7 minute (40 seconds). 1 
55 minute (60 seconds). 1 .3 minutes (80 seconds) and 1 .7 minutes (100 seconds), respectively. It is apparent from Fig. 1 
that the DNA extension rate of the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 was 105 
bases/second. 

1 , 2, 3 and 4 in Fig. 2 show the results of the reaction for 0.7 minute (40 seconds). 1 minute (60 seconds). 1 .3 min- 
utes (80 seconds) and 1 .7 minutes (100 seconds), respectively. It is apparent from Fig. 2 that the DNA extension rate 
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of the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 was 138 bases/second. 

On the other hand, the DNA synthesizing rate of each of Pfu polymerase (Stratgene), Deep Vent polymerase (New 
England Biolabo) and Taq polymerase (Takara Shuzo) was measured by the same manner in each of the buffers there- 
for (Fig. 2a and Fig. 2b). The DNA extension rates of those DNA polymerases were 24.8 bases/second for Pfu polymer- 
5 ase, 23.2 bases/second for Deep Vent polymerase and 61 .0 bases/second for Taq polymerase. 

From the above results, it was suggested that the DNA extension rate of the DNA polymerase derived from the 
hyperthermophilic archaeon strain KOD1 was about six-fold of those of Pfu polymerase and Deep Vent polymerase and 
about two-fold of that of Taq polymerase. 

10 Example 7. 

Measurement of Fidelity of the DNA Polymerase D erived from the Hyperthermophilic archaeon strain KODl in the 
Reaction for the Synthesis of DNA 

is The error rate in the DNA synthesis was measured by the method of Kunkel (Kunkel, 1985, Journal of Biological 
Chemistry, 260, 5787-5796). In this method, a DNA synthesis reaction was conducted using a DNA polymerase derived 
from the hyperthermophilic archaeon strain KOD1 manufactured in Examples 1-5 using an M13mp18DNA having a gap 
at a laqZ part containing a part of the genes coding for p-galactosidase as a substrate and transf ected to E, cdi JM 1 09 
in an NZY medium containing 5-bromo-4-chloro-3-indolyl-p-D-galactoside and isopropyl-thio-p-D-galactoside using an 
20 M13mp18DNA in which lacZ part was double-stranded. 

When p-galactosidase wherein a function is lost or lowered was expressed due to a reading error or a frame shift 
during the synthetic reaction of DNA, it is not possible to utilize 5-bromo-4-chloro-3-indolyl-p-D-galactoside whereupon 
the color of plaque becomes colorless or light blue. On the other hand, when there is no error in the synthesized DNA 
and a complete p-galactosidase was expressed, plaque becomes blue. The rate of induction of error was measured in 
25 the DNA synthesis from the rate of the sum of colorless and light blue plaque to the total plaque. 

The rate of induction of error in the DNA synthesis was also measured for Pfu polymerase (Stratgene), Taq 
polymerase (Takara Shuzo) and delta Tth polymerase (Toyobo) which were made to react by the same manner. 

Further, the rate of induction of error in the DNA synthesis was also measured for a mixture of Taq polymerase and 
Pfu polymerase. The results are given in Table 1. 

30 

Table 1 



Measurement of Fidelity in the Reaction of DNA Synthesis of DNA Polymerase Derived 
from Hyperthermophilic archaeon strain KOD1 



Enzyme 


Light Blue 


White 


Mutant 


Total 


Mutant FrequenceOO" 4 ) 


KOD1 pol. 


12 


11 


23 


6619 


37.7 


Pfu 


15 


15 


30 


7691 


39.0 


Taq 


30 


24 


54 


4141 


130 


ATth 


70 


45 


115 


7375 


156 


Taq/Pfu(20:1) 


10 


20 


30 


4238 


63.7 


Taq/Pfu(50:1) 


10 


13 


23 


4489 


53.5 



It is apparent from Table 1 that the fidelity of the DNA polymerase derived from hyperthermophilic archaeon strain 
KOD1 in the DNA synthesis reaction is suggested to be superior to Taq polymerase and same as Pfu polymerase. In 
addition, a mixture of Taq polymerase and Pfu polymerase exhibits a medium fidelity that it is superior to Taq polymer- 
ase and inferior to Pfu polymerase. 

Example 8. 

Comparison in PCR of Various Thermostable DNA Polyme rases bv the Difference in the Reaction Tima 

lambda-DNA (3 ng) was used as a target nucleic acid; oligo-nucleotides having a sequence as mentioned in SEQ 
ID Nos. 3 and 4 were used as primers; and a buffer containing 20 mM Tris-HCI (pH 7.5 at 75°C), 10 mM KG, 6 mM 
(NH 4 ) 2 S04, 2 mM MgCfe, 0.1% Triton X-100, 10 jig/ml BSA and 200 uM dNTPs was used as a buffer. DNA polymerase 
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derived from hyperthermophilic archaeon strain KOD1 (KOD polymerase), Taq polymerase which is widely used for 
PCR and Pfu polymerase which exhibits 3'-5' exonuclease activity were also used as the thermostable DNA polymer- 
ases, The used titer of each polymerase was 2 units. 

A PCR amplification reaction was conducted using a DNA Thermal Cycler (Perkin-Elmer) in a schedule wherein a 
cycle comprising 94°C/20 seconds and 68°C/x second (x: reaction time) was repeated for 30 times. In the case of the 
DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 (KOD polymerase), amplification of the tar- 
get DNA was confirmed by conducting 30 cycles of 94°C/20 seconds-68°C/1 second while, in the case of Taq polymer- 
ase, amplification of DNA was first confirmed by conducting 30 cycles of 94°C/20 seconds-68°C/10 seconds. In the 
case of Pfu polymerase, amplification of DNA was at least confirmed by conducting 30 cycles of 94°C/20 seconds- 
68°C/1 minute. The results are given in Fig. 3. 

In the present invention, it is possible to amplify the DNA with a high fidelity within a short reaction time when a DNA 
polymerase derived from hyperthermophilic archaeon strain KOD1 which is a thermostable DNA polymerase having at 
least 30 bases/second of DNA extension rate and having a 3'-5' exonuclease activity. When this method is carried out, 
it is possible to improve the simplicity and convenience. In addition, when such kind of thermostable DNA polymerase 
having both high extension rate (at least 30 bases/second) which has not been available yet and 3*-5' exonuclease 
activity is used, it is possible to shorten the time for the primer extention reaction and to amplify the relatively big product 
with a high fidelity. 



SEQUENCE LISTING 

SEQ ID No.l 

Length: 774 base pairs 
Type: amino acid 
Topology: linear 
Type: protein 
Sequence Description: 

Met He Leu Asp Thr Asp Tyr He Thr Glu Asp Gly Lys Pro Val He 

15 10 15 

Arg He Phe Lys Lys Glu Asn Gly Glu Phe Lys lie Glu. Tyr Asp Arg 

20 25 30 

Thr Phe Glu Pro Tyr Phe Tyr Ala Leu Leu Lys Asp Asp Ser Ala He 

35 40 45 

Glu Glu Val Lys Lys He Thr Ala Glu Arg His Gly Thr Val Val Thr 

50 55 60 

Val Lys Arg Val Glu Lys Val Gin Lys Lys Phe Leu Gly Arg Pro Val 
65 70 75 80 

Glu Val Trp Lys Leu Tyr Phe Thr His Pro Gin Asp Val Pro Ala He 

85 90 95 

Arg Asp Lys lie Arg Glu His Gly Ala Val lie Asp He Tyr Glu Tyr 

100 105 110 

Asp lie Pro Phe Ala Lys Arg Tyr Leu lie Asp Lys Gly Leu Val Pro 

115 120 125 

Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe Asp He Gin Thr 
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15 



20 



25 



30 



35 



40 



45 



50 



Pro Asn Lys Pro Asp GIu Lys Glu Leu Ala Arg Arg Arg Gin Ser Tyr 

370 375 380 

Glu Gly Gly Tyr Val Lys Glu Pro Glu Arg Gly Leu Trp Glu Asn lie 
385 390 395 400 

Val Tyr Leu Asp Phe Arg Ser Leu Tyr Pro Ser He lie lie Thr His 

405 410 415 

Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Lys Glu Tyr Asp 

420 425 430 

Val Ala Pro Gin Val Gly His Arg Phe Cys Lys Asp Phe Pro Gly Phe 

435 440 445 

He Pro Ser Leu Leu Gly Asp Leu Leu Glu Glu Arg Gin Lys He Lys 

450 455 460 

Lys Lys Met Lys Ala Thr He Asp Pro lie Glu Arg Lys Leu Leu Asp 
465 470 475 480 

Tyr Arg Gin Arg Ala He Lys He Leu Ala Asn Ser Tyr Tyr Gly Tyr 

485 490 495 

Tyr Gly Tyr Ala Arg Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser 

500 505 510 

Val Thr Ala Trp Gly Arg Glu Tyr He Thr Met Thr He Lys Glu He 

515 520 525 , 

Glu Glu Lys Tyr Gly Phe Lys Val He Tyr Ser Asp Thr Asp Gly Phe 

530 535 540 

Phe Ala Thr He Pro Gly Ala Asp Ala Glu Thr Val Lys Lys Lys Ala 
545 550 555 560 

Met Glu Phe Leu Asn Tyr He Asn Ala Lys Leu Pro Gly Ala Leu Glu 

565 570 575 

Leu Glu Tyr Glu Gly Phe Tyr Lys Arg Gly Phe Phe Val Thr Lys Lys 

580 585 590 

Lys Tyr Ala Val He Asp Glu Glu Gly Lys He Thr Thr Arg Gly Leu' 
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130 135 140 

Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro He Leu Met lie 
145 150 155 160 

Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val He Thr Trp Lys Asn Val 

165 170 175 

Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Arg Glu Met He Lys 

180 185 190 

Arg Phe Leu Arg Val Val Lys Glu Lys Asp Pro Asp Val Leu He Thr 

195 200 205 

Tyr Asn Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys Lys Arg Cys Glu 

210 215 220 

Lys Leu Gly He Asn Phe Ala Leu Gly Arg Asp Gly Ser Glu Pro Lys 
22 5 230 235 240 

lie Gin Arg Met Gly Asp Arg Phe Ala Val Glu Val Lys Gly Arg He 

245 250 255 

His Phe Asp Leu Tyr Pro Val He Arg Arg Thr He Asn Leu Pro Thr 

260 265 270 

Tyr Thr Leu Glu Ala Val Tyr Glu Ala Val Phe Gly Gin Pro Lys Glu 

275 280 285 

Lys Val Tyr Ala Glu Glu He Thr Pro Ala Trp Glu Thr Gly Glu Asn 

290 295 300 

Leu Glu Arg Val Ala Arg Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr 
305 310 315 320 

Glu Leu Gly Lys Glu Phe Leu Pro Met Glu Ala Gin Leu Ser Arg Leu 

325 330 335 

He Gly Gin Ser Leu Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 

340 345 350 

Val Glu Trp Phe Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala 
355 360 365 
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595 600 605 

Glu He Val Arg Arg Asp Trp Ser Glu lie Ala Lys Glu Thr Gin Ala 

610 615 620 

Arg Val Leu Glu Ala Leu Leu Lys Asp Gly Asp Val Glu Lys Ala Val 
625 630 635 640 

Arg He Val Lys Glu Val Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro 

645 650 655 

Pro Glu Lys Leu Val lie His Glu Gin lie Thr Arg Asp Leu Lys Asp 

660 665 670 

Tyr Lys Ala Thr Gly Pro His Val Ma Val Ala Lys Arg Leu Ala Ma 

675 680 685 

Arg Gly Val Lys He Arg Pro Gly Thr Val He Ser Tyr He Val Leu 

690 695 700 

Lys Gly Ser Gly Arg He Gly Asp Arg Ala He Pro Phe Asp Glu Phe 
705 710 715 720 

Asp Pro Thr Lys His Lys Tyr Asp Ala Glu Tyr Tyr He Glu Asn Gin 

725 730 735 

Val Leu Pro Ala Val Glu Arg He Leu Arg Ala Phe Gly Tyr Arg Lys 

740 745 750 

Glu Asp Leu Arg Tyr Gin Lys Thr Arg Gin Val Gly Leu Ser Ala Trp 

755 760 765 

Leu Lys Pro Lys Gly Thr 
770 
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SEQ ID No. 2 

Length: 24 base pairs 
Type: nucleic acid 
Topology: linear 
Type: synthetic DNA 
Sequence Description: 

CGCCAGGGTT TTCCAGTCA CGAC 
SEQ ID No. 3 

Length: 20 base pairs 

Type: nucleic acid 

Topology: linear 

Molecular Type: synthetic DNA 

Sequence Description: 

GGGCGGCGAC CTCGCGGGTT 
SEQ ID No. 4 

Length: 24 base pairs 

Type: nucleic acid 

Topology: linear 

Molecular Type: synthetic DNA 

Sequence Description: 

GCCCATAATA ATCTGCCGGT CAAT 
SEQ ID No. 5 

Length: 5342 base pairs 

Type: nucleic acid (DNA) 

Strandedness : double 

Topology: linear 

Molecular Type: cDNA 

Source: hyperthermophilic archaeon 
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Strain name: KOD1 

/ 156-5X65 P CDS 
Characteristics:^ 1374-2453 intervening sequence 

[ 2708-4316 intervening sequence 
Sequence Description: 

GCTTGAGCGC CTGCGGTTAT GGGACGTTGC AGTTTGCGCC TACTCAAAGA TGCCGGTTTT 60 
ATAACGGAGA AAAATGGGGA GCTATTACGA TCTCTCCTTG ATGTGGGGTT TACAATAAAG 120 
CCTGGATTGT TCTACAAGAT TATGGGGGAT GAAAG ATG ATC CTC GAC ACT GAC 173 

Met He Leu Asp Thr Asp 
i 5 

TAC ATA ACC GAG GAT GGA A AG CCT GTC ATA AGA ATT TTC A AG A AG GAA 221 
Tyr He Thr Glu Asp Gly Lys Pro Va! He Arg He Phe Lys Lys Glu 

10 15 20 

AAC GGC GAG TTT AAG ATT GAG TAC GAC CGG ACT TTT GAA CCC TAC TTC 269 
Asn Gly Glu Phe Lys He Glu Tyr Asp Arg Thr Phe Glu Pro Tyr Phe 

25 30 35 

TAC GCC CTC CTG AAG GAC GAT TCT GCC ATT GAG GAA GTC AAG AAG ATA 317. 
Tyr Ala Leu Leu Lys Asp Asp Ser Ala lie Glu Glu Val Lys Lys He 

40 45 50 

ACC GCC GAG AGG CAC GGG ACG GTT GTA ACG GTT AAG CGG GTT GAA AAG 365 
35 Thr Ala Glu Arg His Gly Thr Val Val Thr Val Lys Arg Val Glu Lys 

55 60 65 70 

GTT CAG AAG AAG TTC CTC GGG AGA CCA GTT GAG GTC TGG AAA CTC TAC 413 
40 Val Gin Lys Lys Phe Leu Gly Arg Pro Val Glu Val Trp Lys Leu Tyr 

75 80 85 

TTT ACT CAT CCG CAG GAC GTC CCA GCG ATA AGG GAC AAG ATA CGA GAG 461 
Phe Thr His Pro Gin Asp Val Pro Ala He Arg Asp Lys He Arg Glu 

90 95 100 

CAT GGA GCA GTT ATT CAC ATC TAC GAG TAC GAC ATA CCC TTC GCC AAG 509 
His Gly Ala Val He Asp He Tyr Glu Tyr Asp He Pro Phe Ala Lys 
105 110 115 

55 



20 
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35 



40 



45 



CGC TAG 
Arg Tyr 
120 
CTG AAA 
Leu Lys 
135 

GAG TTC 
Glu Phe 

GGG GCC 
Gly Ala 

GTC GTC 
Val Val 

AAG GAG 
Lys Glu 
200 
GAC TTC 
Asp Phe 
215 

GCC CTC 
Ala Leu 

AGG TTT 
Arg Phe 

GTG ATA 
Val He 



CTC ATA 
Leu ! I e 

ATG CTC 
Met Leu 

GCC GAG 
Ala Glu 

AGG GTG 
Arg Val 
170 
TCG ACG 
Ser Thr 
185 

AAA GAC 
Lys Asp 

GCC TAT 
Ala Tyr 

GGA AGG 
Gly Arg 

GCC GTC 
Ala Val 
250 
AGA CGG 
Arg Arg 



GAC AAG 
Asp Lys 

GCC TTC 
Ala Phe 
140 
GGG CCA 
Gly Pro 
155 

ATA ACT 
1 1 e Thr 

GAG AGG 
Glu Arg 

CCG GAC 
Pro Asp 

CTG AAA 
Leu Lys 
220 
GAT GGA 
Asp Gly 
235 

GAA GTG 
Glu Val 

ACG ATA 
Thr He 



GGA TTA GTG 
Gly Leu Val 
125 

GAC ATT CAA 
Asp He Gin 

ATC CTT ATG 
He Leu Met 

TGG AAG AAC 
Trp Lys Asn 
175 

GAG ATG ATA 
Glu Met lie 
190 

GTT CTC ATA 
Val Leu He 
205 

AAG CGC TGT 
Lys Arg Cys 

AGC GAG CCG 
Ser Glu Pro 

AAG GGA CGG 
Lys Gly Arg 
255 

AAC CTG CCC 
Asn Leu Pro 



CCA ATG 
Pro Met 

ACT CTC 
Thr Leu 
145 
ATA AGC 
He Ser 
160 

GTG GAT 
Val Asp 

AAG CGC 
Lys Arg 

ACC TAC 
Thr Tyr 

GAA AAG 
Glu Lys 
.225 
AAG ATT 
Lys lie 
240 

ATA CAC 
He His 

ACA TAC 
Thr Tyr 



GAA GGC GAC 
Glu Gly Asp 
130 

TAC CAT GAG 
Tyr His Glu 

TAC GCC GAC 
Tyr Ala Asp 

CTC CCC TAC 
Leu Pro Tyr 
180 

TTC CTC CGT 
Phe Leu Arg 

195 
AAC GGC GAC 
Asn Gly Asp 
210 

CTC GGA ATA 
Leu Gly lie 

CAG AGG ATG 
Gin Arg Met 

TTC GAT CTC 
Phe Asp Leu 
260 

ACG CTT GAG 
Thr Leu Glu 



GAG GAG 
Glu Glu 

GGC GAG 
Gly Glu 
150 
GAG GAA 
Glu Glu 
165 

GTT GAC 
Val Asp 

GTT GTG 
Vai Val 

AAC TTC 
Asn Phe 

AAC TTC 
Asn Phe 
230 
GGC GAC 
Gly Asp 
245 

TAT CCT 
Tyr Pro 

GCC GTT 
Ala Val 



557 



605 



653 



701 



749 



797 



845 



893 



941 



989 



55 
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35 



40 



45 
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TAT GAA 
Tyr Glu 
280 
ATA ACA 
He Thr 
295 

TAC TCG 
Tyr Ser 

CTT CCG 
Leu Pro 

GAC GTC 
Asp Val 

AGG AAG 
Arg Lys 
360 
AAG GAG 
Lys Glu 
375 

GAG CCC 
Glu Pro 

TGC CAT 
Cys His 

AAC ATC 



265 

GCC GTC TTC GGT 
Ala Val Phe Gly 



CCA GCC 
Pro Ala 

ATG GAA 
Met Glu 

ATG GAG 
Met Glu 
330 
TCC CGC 
Ser Arg 
345 

GCC TAT 
Ala Tyr 



TGG GAA 
Trp Glu 
300 
GAT GCG 
Asp Ala 
315 

GCC CAG 
Ala Gin 

TCC AGC 
Ser Ser 

GAG AGG 
Glu Arg 



CTG GCC AGA AGA 
Leu Ala Arg Arg 
380 

GAG AGA GGG TTG 
Glu Arg Gly Leu 
395 

CCA GCC GAT ACG 
Pro Ala Asp Thr 
410 

AGC GAG GTT CAG 



270 
CAG CCG AAG 
Gin Pro Lys 
285 

ACC GGC GAG 
Thr Gly Glu 

AAG GTC ACA 
Lys Val Thr 

CTT TCT CGC 
Leu Ser Arg 
335 

ACT GGC AAC 
Thr Gly Asn 

350 
AAT GAG CTG 
Asn Glu Leu 
365 

CGG CAG AGC 
Arg Gin Ser 

TGG GAG AAC 
Trp Glu Asn 



GAG AAG 
Glu Lys 

AAC CTT 
Asn Leu 
305 
TAC GAG 
Tyr Glu 
320 

TTA ATC 
Leu 1 1 e 

CTC GTT 
Leu Val 

GCC CCG 
Ala Pro 

TAT GAA 
Tyr Glu 
385 
ATA GTG 
He Val 
400 

GTC AAG 
Val Lys 



275 
GTT TAC 
Val Tyr 
290 

GAG AGA 
Glu Arg 

CTT GGG 
Leu Gly 

GGC CAG 
Gly Gin 

GAG TGG 
Glu Trp 
355 
AAC AAG 
Asn Lys 
370 

GGA GGC 
Gly Gly 

TAC CTA 
Tyr Leu 



GGG AAG 
Gly Lys 



AAG GTT GTC 
Lys Val Val 
415 

GAA GGT GAC TAT GTC CTT GGG 



GCT GAG GAA 
Ala Glu Glu 

GTC GCC CGC 
Val Ala Arg 
310 

AAG GAG TTC 
Lys Glu Phe 

325 
TCC CTC TGG 
Ser Leu Trp 
340 

TTC CTC CTC 
Phe Leu Leu 

CCC GAT GAA 
Pro Asp Glu 

TAT GTA AAA 
Tyr Val Lys 
390 

GAT TTT AGA 
Asp Phe Arg 

405 
GGG ATT ATA 
Gly lie He 
420 

ATT GAC GGC* 
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Asn He Ser Glu Val 
425 

AGA GTT AGA 
Arg Val Arg 



50 



TGG CAG 
Trp Gin 
440 
GTA AAC 
Val Asn 
455 

GTT ACA 
Val Thr 

TCT TTC 
Ser Phe 

TTC TAT 
Phe Tyr 

GTT CTC 
Val Leu 
520 
HG AGG 
Leu Arg 
535 

ATT TCA 
He Ser 

GAG TTT 
Glu Phe 



ATA AAC GGG 
He Asn Gly 

AAG AAC GAA 
Lys Asn Glu 
475 

CTT ACT AAA 
Leu Thr Lys 

490 
GAA ATA GGC 
Glu He Gly 
505 

AAG GGA GAG 
Lys Gly Glu 

AAA GAC GTT 
Lys Asp Val 

CAC CAG TAT 
His Gin Tyr 
555 

AGG GAT CGT 
Arg Asp Arg 
570 



Gin Glu Gly 
430 

AAA GTA TGG 
Lys Val Trp 

445 
TTA AAG TGT 
Leu Lys Cys 
460 

CGA CAA ACG 
Arg Gin Thr 

AAA GTT AAG 
Lys Val Lys 

AGA GCG ACA 
Arg Ala Thr 
510 

CTC GCT GGC 
Leu Ala Gly 

525 
GAA TAC TTT 
Glu Tyr Phe 
540 

CGT GTT GAG 
Arg Val Glu 

ATC ACA TAC 
He Thr Tyr 



Asp Tyr Val Leu Gly lie Asp Gly 
435 

AAA GGG GAG CTT 
Lys Gly Glu Leu 



GAA TAC 
Glu Tyr 



ACG CCC 
Thr Pro 

AGA ATA 
Arg lie 
480 
GGC AAG 
Gly Lys 
495 

ACT GAG 
Ser Glu 

ATA CTA 
lie Leu 

GAT TCA 
Asp Ser 

ATA ACC 
He Thr 
560 
ATT TTT 
He Phe 
575 



GAC TAC 
Asp Tyr 
450 
AAT CAT 
Asn His 
465 

AGA GAC 
Arg Asp 

ATA ATA 
lie He 

AAT ATT 
Asn He 

TTG GCT 
Leu Ala 
530 
TCC CGC 
Ser Arg 
545 

ATT GGG 
He Gly 

GAG CGT 
Glu Arg 



AAG CTT 
Lys Leu 

ACT CTT 
Ser Leu 

ACC ACT 
Thr Thr 
500 
CCA GAA 
Pro Glu 
515 

GAA GGA 
Glu Gly 



CCC GTT 
Pro Val 
470 
GCT AAG 
Ala Lys 
485 

CCC CTT 
Pro Leu 

GAA GAG 
Glu Glu 

ACG CTC 
Thr Leu 



AAA AAA CGG AGG 
Lys Lys Arg Arg 
550 

AAA GAC GAG GAG 
Lys Asp Glu Glu 
565 

TTG TTT GGG ATT 
Leu Phe Gly He 
580 
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ACT CCA 
Thr Pro 

GTT GCG 
Val Ala 
600 
ATA GAG 
Me Glu 
615 

GAC GGT 
Asp Gly 

ACA AAG 
Thr Lys 

CTT GGT 
Leu Gly 

AAA GAT 
Lys Asp 
680 
ATA CTG 
lie Leu 
695 

CTG CTT 
Leu Leu 

AAT GGA 
Asn Gly 



AGC ATC TCG 
Ser He Ser 
585 

AAG AAG AAT 
Lys Lys Asn 

TCC CTA CAT 
Ser Leu His 

TCA GTA AAC 
Ser Val Asn 
635 

AAC GAG TGG 
Asn Glu Trp 

650 
ATC CCT CAT 
lie Pro His 
665 

CGG AGC AGG 
Are Ser Arg 

TTC CAA ACA 
Phe Gin Thr 

AAT AAG GCA 
Asn Lys Ala 
715 

TTT TAC AGG 
Phe Tyr Arg 



GAG AAG 
Glu Lys 

GTT TAT 
Val Tyr 
605 
GCC CCC 
Ala Pro 
620 

AGG GTT 
Arg Val 

AAG ATT 
Lys lie 

CAA ACG 
Gin Thr 

TAT ATA 
Tyr He 
685 
CTC ATT 
Leu lie 
700 

ATA TCT 
He Ser 

CTC AGT 
Leu Ser 



AAA GGA 
Lys Gly 
590 

CTT AAA 
Leu Lys 

TCG GTT 
Ser Val 

AGG AGG 
Arg Arg 

AAA CTG 
Lys Leu 
655 
TAC ACG 
Tyr Thr 
670 

CTG GAG 
Leu Glu 

GGA TTC 
Gly Phe 

CAG AGG 
Gin Arg 

GAA TTC 
Glu Phe 



ACT AAC GCA 
Thr Asn Ala 

GTC AAG GAA 
Val Lys Glu 
610 

CTC AGG GGA 
Leu Arg Gly 

625 
AGT ATT GTT 
Ser He Val 
640 

GTC- TCA AAA 
Val Ser Lys 

TAT CAG TAT 
Tyr Gin Tyr 

ATA ACT GGA 
He Thr Gly 
690 

ATC AGT GAA 
He Ser Glu 

705 
GAA ATG AAC 
Glu Met Asn 
720 

AAT GTC AGC 
Asn Val Ser 



GTA ACA CTC AAA 
Val Thr Leu Lys 
595 

ATT ATG GAC AAC 
He Met Asp Asn 



TTC TTC 
Phe Phe 

GCA ACC 
Ala Thr 

CTG CTC 
Leu Leu 
660 
CAG GAA 
Gin Glu 
675 

AAG GAC 
Lys Asp 



GAA GGC 
Glu Gly 
630 
CAG GGT 
Gin Gly 
645 

TCC CAG 
Ser Gin 

AAT GGG 
Asn Gly 

GGA TTC 
Gly Leu 



AGA AAG AAC GCT 
Arg Lys. Asn Ala 
710 

AAC TTG GAA AAC 
Asn Leu Glu Asn 
725 

ACG GAA TAC TAT 
Thr Glu Tyr Tyr 



1949 
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2045 
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730 735 740 

GAG GGC AAG GTC TAT GAC TTA ACT CTT GAA GGA ACT CCC TAG TAC TTT 2429 

Glu Gly Lys Val Tyr Asp Leu Thr Leu Glu Gly Thr Pro Tyr Tyr Phe 

745 750 755 

GCC AAT GGC ATA TTG ACC CAT AAC TCC CTG TAC CCC TCA ATC ATC ATC 2477 

Ala Asn Gly Me Leu Thr His Asn Ser Leu Tyr Pro Ser He He He 

760 765 770 

ACC CAC AAC GTC TCG CCG GAT ACG CTC AAC AGA GAA GGA TGC AAG GAA 2525 

Thr His Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Lys Glu 
775 780 785 790 

TAT GAC GTT GCC CCA CAG GTC GGC CAC CGC TTC TGC AAG GAC TTC CCA 2573 

Tyr Asp Val Ala Pro Gin Val Gly His Arg Phe Cys Lys Asp Phe Pro 

795 800 805 

GGA TTT ATC CCG AGC CTG CTT GGA GAC CTC CTA GAG GAG AGG CAG AAG 2621 

Gly Phe lie Pro Ser Leu Leu Gly Asp Leu Leu Glu Glu Arg Gin Lys 

810 815 820 

ATA AAG AAG AAG ATG AAG GCC ACG ATT GAC CCG ATC GAG AGG AAG CTC 2669 

He Lys Lys Lys Met Lys Ala Thr He Asp Pro He Glu Arg Lys Leu 

825 830 835 

CTC GAT TAC AGG CAG AGG GCC ATC AAG ATC CTG GCA AAC AGC ATC CTA 2717 

Leu Asp Tyr Arg Gin Arg Ala lie Lys He Leu Ala Asn Ser He Leu 

840 845 850 

CCC GAG GAA TGG CTT CCA GTC CTC GAG GAA GGG GAG GTT CAC TTC GTC 2765 

Pro Glu Glu Trp Leu Pro Val Leu Glu Glu Gly Glu Val His Phe Val 
855 860 865 870 

AGG ATT GGA GAG CTC ATA GAC CGG ATG ATG GAG GAA AAT GOT GGG AAA 2813 

Arg He Gly Glu Leu He Asp Arg Met Met Glu Glu Asn Ala Gly Lys 

875 880 885 

GTA AAG AGA GAG GGC GAG ACG GAA CTG CTT GAG GTC ACT GGG CTT GAA' 2861 
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Val Lys Arg 

GTC CCG TCC 
Val Pro Ser 
905 

AAG GCC CTG 
Lys Ala Leu 

920 
CTG AAG TCG 
Leu Lys Ser 
935 

TCT GTG AGA 
Ser Val Arg 

CCA GGT GAC 
Pro Gly Asp 

AAC CAC GTG 
Asn His Val 
985 

GAA ACT TTG 
Glu Thr Leu 

1000 
TTC TTT AAA 
Phe Phe Lys 
1015 

AAG AGG CCC 
Lys Arg Pro 



Glu Gly Glu Thr Glu 
890 

TTT AAC AGG AGA ACT 
Phe Asn Arg Arg Thr 
910 

ATT AGG CAC GAT TAT 
He Arg His Asp Tyr 
925 

GGG AGG AGA ATA AAG 
Gly Arg Arg lie Lys 
940 

AAC GGG GAG CTC GTT 
Asn Gly Glu Leu Val 
955 

CTC GTT GCA GTC CCG 
Leu Val Ala Val Pro 
970 

CTG AAC CTC GTT GAA 
Leu Asn Leu Val Glu 
990 

GAC ATC GTC ATG ACG 
Asp lie Val Met Thr 
1005 

GGG ATG CTC AGG ACT 
Gly Met Leu Arg Thr 
1020 

AGA ACC GCG AGA CGC 
Arg Thr Ala Arg Arg 
1035 



Val Leu Glu Val Ser 
895 

AAC AAG GCC GAG CTC 
Asn Lys A! a Glu Leu 
915 

TCT GGC AAG GTC TAC 
Ser Gly Lys Val Tyr 
930 

ATA ACC TCT GGC CAC 
He Thr Ser Gly His 
945 

GAA GTT ACG GGC GAT 
Glu Val Thr Gly Asp 
960 

CGG AGA TTG GAG CTT 
Arg Arg Leu Glu Leu 
975 

CTG CTC CTT GGA ACG 
Leu Leu Leu Gly Thr 
995 

ATC CCA GTC AAG GGT 
lie Pro Val Lys Gly 
1010 

TTG CGC TGG ATT TTC 
Leu Arg Trp lie Phe 
1025 

TAT CTC AGG CAC CTT 
Tyr Leu Arg His Leu 
1040 



Gly Leu Glu 
900 

AAG AGA GTA 
Lys Arg Val 

ACC ATC AGA 
Thr lie Arg 

AGC CTC TTC 
Ser Leu Phe 
950 

GAA CTA AAG 
Glu Leu Lys 

965 
CCT GAG AGA 
Pro Glu Arg 
980 

CCA GAA GAA 
Pro Glu Glu 

AAG AAG AAC 
Lys Lys Asn 

GGA GAG GAA 
Gly Glu Glu 
1030 

GAG GAT CTG 
Glu Asp Leu 
1045 
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GGC TAT GTC CGG CTT AAG AAG ATC GGC TAC GAA GTC CTC GAC TGG GAC 3341 
Gly Tyr Val Arg Leu Lys Lys He Gly Tyr Glu Vai Leu Asp Trp Asp 

1050 1055 1060 

TCA CTT AAG AAC TAC AGA AGG CTC TAC GAG GCG CTT GTC GAG AAC GTC 3389 
Ser Leu Lys Asn Tyr Arg Arg Leu Tyr Glu Ala Leu Val Glu Asn Val 

1065 1070 1075 

AGA TAC AAC GGC AAC AAG AGG GAG TAC CTC GTT GAA TTC AAT TCC ATC 3437 
Arg Tyr Asn Gly Asn Lys Arg Glu Tyr Leu Val Glu Phe Asn Ser He 

1080 1085 1090 

CGG GAT GCA GTT GGC ATA ATG CCC CTA AAA GAG CTG AAG GAG TGG AAG 3485 
so Arg Asp Ala Val Gly He Met Pro Leu Lys Glu Leu Lys Glu Trp Lys 

1095 1100 1105 1110 

ATC GGC ACG CTG AAC GGC TTC AGA ATG AGA AAG CTC ATT GAA GTG GAC 3533 
25 ,le G| V Thr Le " Asn Gly Phe Arg Met Arg Lys Leu He Glu Val Asp 

1H5 1120 1125 

GAG TCG TTA GCA AAG CTC CTC GGC TAC TAC GTG AGC GAG GGC -TAT GCA 3581 
Glu Ser Leu Ala Lys Leu Leu Gly Tyr Tyr Val Ser Glu Gly Tyr Ala 

1130 1135 1140 

AGA AAG CAG AGG AAT CCC AAA AAC GGC TGG AGC TAC AGC GTG AAG CTC 3629 
Arg Lys Gin Arg Asn Pro Lys Asn Gly Trp Ser Tyr Ser Val Lys Leu 

1150 1155 
TAC AAC GAA GAC CCT GAA GTG CTG GAC GATATG GAG AGA CTC GCC AGC ' 3677 
Tyr Asn Glu Asp Pro Glu Val Leu Asp Asp Met Glu Arg Leu Ala Ser 

1160 1165 1170 

AGG TTT TTC GGG AAG GTG AGG CGG GGC AGG AAC TAC GTT GAG ATA CCG 3725 
Arg Phe Phe Gly Lys Val Arg Arg Gly Arg Asn Tyr Val Glu He Pro 
1175 1180 1185 1190 

so AAG AAG ATC GGC TAC CTG CTC TTT GAG AAC ATG TGC GGT GTC CTA GCG 3773 

Lys Lys He Gly Tyr Leu Leu Phe Glu Asn Met Cys Gly Val Leu Ala 
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1195 1200 1205 

GAG AAC AAG AGG ATT CCC GAG TTC GTC TTC ACG TCC CCG AAA GGG GTT 3821 
5 Glu Asn Lys Arg lie Pro Glu Phe Val Phe Thr Ser Pro Lys Gly Val 

1210 1215 1220 

CGG CTG GCC TTC CTT GAG GGG TAC TCA TCG GCG ATG GCG ACG TCC ACC 3869 
10 Arg Leu Ala Phe Leu Glu Gly Tyr Ser Ser Ala Met Ala Thr Ser Thr 

1225 1230 1235 

GAA CAA GAG ACT CAG GCT CTC AAC GAA AAG CGA GCT TTA GCG AAC CAG 3917 

15 

Glu Gin Glu Thr Gin Ala Leu Asn Glu Lys Arg Ala Leu Ala Asn Gin 
1240 1245 1250 

20 CTC GTC CTC CTC TTG AAC TCG GTG GGG GTC TCT GCT GTA AAA CTT GGG 3965 

Leu Val Leu Leu Leu Asn Ser Val Gly Val Ser Ala Val Lys Leu Gly 
1255 1260 1265 1270 

S5 CAC GAC AGC GGC GTT TAC AGG GTC TAT ATA AAC GAG GAG CTC CCG TTC 4013 

His Asp Ser Gly Val Tyr Arg Val Tyr lie Asn Glu Glu Leu Pro Phe 
1275 1280 1285 

30 GTA AAG CTG GAC . AAG AAA AAG AAC GCC TAC TAC TCA CAC GTG ATC CCC 4061 

Val Lys Leu Asp Lys Lys Lys Asn Ala Tyr Tyr Ser His Val lie Pro 
1290 1295 1300 

35 AAG GAA GTC CTG AGC GAG GTC TTT GGG AAG GTT TTC CAG AAA AAC GTC 4109 

Lys Glu Val Leu Ser Glu Val Phe Gly Lys Val Phe Gin Lys Asn Val 
1305 1310 1315 

40 

AGT CCT CAG ACC TTC AGG AAG ATG GTC GAG GAC GGA AGA CTC GAT CCC 4157 
Ser Pro Gin Thr Phe Arg Lys Met Val Glu Asp Gly Arg Leu Asp Pro 
1320 1325 1330 

45 

GAA AAG GCC CAG AGG CTC TCC TCG CTC ATT GAG GGG GAC GTA GTG CTC 4205 
Glu Lys Ala Gin Arg Leu Ser Trp Leu lie Glu Gly Asp Val Val Leu 
1335 1340 1345 1350 

SO 

GAC CGC GTT GAG TCC GTT GAT GTG GAA GAC TAC GAT GGT TAT GTC TAT* 4253 
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Asp Arg Val CIu Ser Val Asp Val G!u Asp Tyr Asp Gly Tyr Val Tyr 

1355 1360 1365 

GAC CTG AGC GTC GAG GAC AAC GAG AAC TTC CTC GTT GGC TTT GGG TTG 4301 
Asp Leu Ser Val Glu Asp Asn Glu Asn Phe Leu Val Gly Phe Gly Leu 

1370 1375 1380 

GTC TAT GCT CAC AAC AGC TAC TAC GGT TAC TAC GGC TAT GCA AGG GCG 4349 
Val Tyr Ala His Asn Ser Tyr Tyr Gly Tyr Tyr Gly Tyr Ala Arg Ala 

1385 1390 1395 

CGC TGG TAC TGC AAG GAG TGT GCA GAG AGC GTA ACG GCC TGG GGA AGG 4397 
Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser Val Thr Ala Trp Gly Arg 

1400 1405 1410 

GAG TAC ATA ACG ATG ACC ATC AAG GAG ATA GAG GAA AAG TAC GGC TTT 4445 
Glu Tyr He Thr Met Thr He Lys Glu lie Glu Glu Lys Tyr Gly Phe 
1415 1420 1425 1430 

AAG GTA ATC TAC AGC GAC ACC GAC GGA TTT TTT GCC ACA ATA CCT GGA 4493 
Lys Val Me Tyr Ser Asp Thr Asp Gly Phe Phe Ala Thr lie Pro Gly 

1435 1440 1445 

GCC GAT GCT GAA ACC GTC AAA AAG AAG GCT ATG GAG TTC CTC AAC TAT 4541 
Ala Asp Ala Glu Thr Val Lys Lys Lys Ala Met Glu Phe Leu Asn Tyr 

1450 1455 1460 

ATC AAC GCC AAA CTT CCG GGC GCG CTT GAG CTC GAG TAC GAG GGC TTC 4589 
He Asn Ala Lys Leu Pro Gly Ala Leu Glu Leu Glu Tyr Glu Gly Phe 

1465 1470 1475 

TAC AAA CGC GGC TTC TTC GTC ACG AAG AAG AAG TAT GCG GTG ATA GAC 4637 
Tyr Lys Arg Gly Phe Phe Val Thr Lys Lys Lys Tyr Ala Val lie Asp 

1480 1485 1490 

GAG GAA GGC AAG ATA ACA ACG CGC GGA CTT GAG ATT GTG AGG CCT GAC 4685 
Glu Glu Gly Lys He Thr Thr Arg Gly Leu Glu lie Val Arg Arg Asp 
1495 1500 1505 151T) 
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TGG AGC GAG ATA GCG AAA. GAG ACS CAG GCG AGG GTT CTT GAA GCT TTG 4733 
Trp Ser Glu He Ala Lys Glu Thr Gin Ala Arg Val Leu Glu Ala Leu 

1515 1520 1525 

CTA AAG GAC GGT GAC GTC GAG AAG GCC GTG AGG ATA GTC AAA GAA GTT 4781 
Leu Lys Asp Gly Asp Val Glu Lys Ala Val Arg He Val Lys Glu Val 

1530 1535 1540 

ACC GAA AAG CTG AGC AAG TAC GAG GTT CCG CCG GAG AAG CTG GTG ATC 4829 
Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro Pro Glu Lys Leu Val He 

1545 1550 1555 

CAC GAG CAG ATA ACG AGG GAT TTA AAG GAC TAC AAG GCA ACC GGT CCC 4877 
His Glu Gin lie Thr Arg Asp Leu Lys Asp Tyr Lys Ala Thr Gly Pro 

1560 1565 1570 

CAC GtT GCC GTT GCC AAG AGG TTG GCC GCG AGA GGA GTC AAA ATA CGC 4925 
His Val Ala Val Ala Lys Arg Leu A!a Ala Arg Gly Val Lys lie Arg 
1575 1580 1585 1590 

CCT GGA ACG GTG ATA AGC TAC ATC GTG CTC AAG GGC TCT GGG AGG ATA 4973 
Pro Gly Thr Val He Ser Tyr He Val Leu Lys Gly Ser Gly Arg He 

1595 1600 1605 

GGC GAC AGG GCG ATA CCG TTC GAC GAG TTC GAC CCG ACG AAG CAC AAG 5021 
Gly Asp Arg Ala He Pro Phe Asp Glu Phe Asp Pro Thr Lys His Lys 

1610 1615 1620 

TAC GAC GCC GAG TAC TAC ATT GAG AAC CAG GTT CTC CCA GCC GTT GAG 5069 
Tyr Asp Ala Glu Tyr Tyr He Glu Asn Gin Val Leu Pro Ala Val Glu 

1625 1630 1635 

AGA ATT CTG AGA GCC TTC GGT TAC CGC AAG GAA GAC CTG CGC TAC CAG 5117 
Arg lie Leu Arg Ala Phe Gly Tyr Arg Lys Glu Asp Leu Arg Tyr Gin 

1640 1645 1650 

AAG ACG AGA CAG GTT GGT TTG AGT GCT TGG CTG AAG CCG AAG GGA ACT 5165 
Lys Thr Arg Gin Val Gly Leu Ser Ala Trp Leu Lys Pro Lys Gly Thr' 
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1655 1660 1665 1670 

TGACCTTTCC ATTTGTTTTC CAGCGGATAA CCCTTTAACT TCCCTTTCAA AAACTCCCT 5225 

TAGGGAAAGA CCATGAAGAT AGAAATCCGG CGGCGCCCGG TTAAATACGC TAGGATAGA 5285 

GTGAAGCCAG ACGGCAGGGT AGTCGTCACT GCCCCGAGGG TTCAACGTTG AGAAGTT 5342 

SEQ ID No. 6 

Length: 5342 base pairs 

Type: nucleic acid (DNA) 

Strandedness : double 

Topology: linear 

Molecular Type: cDNA 

Source: hyperthermophilic archaeon 

Strain name: KOD1 

Sequence Description : 



GCTTGAGGGC CTGCGGTTAT 


GGGACGTTGC AGTTTGCGCC 


TACTCAAAGA 


TGCCGGTTTT 


60 


ATAACGGAGA AAAATGGGGA 


GCTATTACGA TCTCTCCTTG 


ATGTGGGGTT 


TACAATAAAG 


120 


CCTGGATTGT TCTACAAGAT 


TATGGGGGAT GAAAGATGAT 


CCTCGACACT 


GACTACATAA 


180 


CCGAGGATGG AAAGCCTGTC 


ATAAGAATTT TCAAGAAGGA 


AAACGGCGAG 


TTTAAGATTG 


240 


AGTACGACCG GACTTTTGAA 


CCCTACTTCT ACGCCCTCCT 


GAAGGACGAT 


TCTGCCATTG 


300 


AGGAAGTCAA GAAGATAACC 


GCCGAGAGGC ACGGGACGGT 


TGTAACGGTT 


AAGCGGGTTG 


360 


AAAAGGTTCA GAAGAAGTTC 


CTCGGGAGAC CAGTTGAGGT 


CTGGAAACTC 


TACTTTACTC 


420 


ATCCGCAGGA CGTCCCAGCG 


ATAAGGGACA AGATACGAGA 


GCATGGAGCA 


GTTATTGACA 


480 


TCTACGAGTA CGACATACCC 


TTCGCCAAGC GCTACCTCAT 


AGACAAGGGA 


TTAGTGCCAA 


540 


TGGAAGGCGA CGAGGAGCTG 


AAAATGCTCG CCTTCGACAT 


TCAAACTCTC 


TACCATGAGG 


600 


GCGAGGAGTT CGCCGAGGGG 


CCAATCCTTA TGATAAGCTA 


CGCCGACGAG 


GAAGGGGCCA 


660 


GGGTGATAAC TTGGAAGAAC 


GTGGATCTCC CCTACGTTGA 


CGTCGTCTCG 


ACGGAGAGGG 


720 


AGATGATAAA GCGCTTCCTC 


CGTGTTGTGA AGGAGAAAGA 


CCCGGACGTT 


CTCATAACCT 


780 


ACAACGGCGA CAACTTCGAC 


TTCGCCTATC TGAAAAAGCG 


CTGTGAAAAG 


CTCGGAATAA 


840 


ACTTCGCCCT CGGAAGGGAT 


GGAAGCGAGC CGAAGATTCA 


GAGGATGGGC 


GACAGGTTTG 


900 
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CCGTCGAAGT GAAGGGACGG ATACACTTCG ATCTCTATCC TGTGATAAGA CGGACGATAA 960 
ACCTGCCCAC ATACACGCTT GAGGCCGTTT ATGAAGCCGT CTTCGGTCAG CCGAAGGAGA 1020 
AGGTTTACGC TGAGGAAATA ACACCAGCCT GGGAAACCGG CGAGAACCTT GAGAGAGTCG 1080 
CCCGCTACTC GATGGAAGAT GCGAAGGTCA CATACGAGCT TGGGAACGAG TTCCTTCCGA 1140 
TGGAGGCCCA GCTTTCTCGC TTAATCGGCC AGTCCCTCTG GGACGTCTCC CGCTCCAGCA 1200 
CTGGCAACCT CGTTGAGTGG TTCCTCCTCA GGAAGGCCCT ATGAGAGGAA TGAGCTGGCC 1260 
CCCAACAAGC CCGATGAAAA GGAGCTGGCC AGAAGACGGC AGAGCTATGA AGGAGGCTAT 1320 
GTAAAAGAGC CCGAGAGAGG GTTGTGGGAG AACATAGTGT ACCTAGATTT TAGATGCCAT 1380 
CCAGCCGATA CGAAGGTTGT CGTCAAGGGG AAGGGGATTA TAAACATCAG CGAGGTTCAG 1440 
GAAGGTGACT ATGTCCTTGG GATTGACGGC TGGCAGAGAG TTAGAAAAGT ATGGGAATAC 1500 
GACTACAAAG GGGAGCTTGT AAACATAAAC GGGTTAAAGT GTACGCCCAA TCATAAGCTT 1560 
CCCGTTGTTA CAAAGAACGA ACGACAAACG AGAATAAGAG ACAGTCTTGC TAAGTCTTTC 1620 
CTTACTAAAA AAGTTAAGGG CAAGATAATA ACCACTCCCC TTTTCTATGA AATAGGCAGA, 1680 
GCGACAAGTG AGAATATTCC AGAAGAAGAG GTTCTCAAGG GAGAGCTCGC TGGCATAGTA. 1740 
TTGGCTGAAG GAACGCTCTT GAGGAAAGAC GTTGAATACT TTGATTCATC CCGCAAAAAA 1800 
CGGAGGATTT CACACCAGTA TCGTGTTGAG ATAACCATTG GGAAAGACGA GGAGGAGTTT 1860 
AGGGATCGTA TCACATACAT TTTTGAGCGT TTGTTTGGGA TTACTCCAAG CATCTCGGAG 1920 
AAGAAAGGAA CTAACGCAGT AACACTCAAA GTTGCGAAGA AGAATGTTTA TCTTAAAGTC 1980 
AAGGAAATTA TGGACAACAT AGAGTCCCTA CATGCCCCCT CGGTTCTCAG GGGATTCTTC 2040 
GAAGGCGACG GTTCAGTAAA CAGGTTAGGA GGAGTATTGT TGCAACCCAG GGTACAAAGA 2100 
ACGAGTGGAA GATTAAACTG GTGTCAAAAC TGCTCTCCCA GCTTGGTATC CCTCATCAAA 2160 
CGTACACGTA TCAGTATCAG GAAAATGGGA AAGATCGGAG CAGGTATATA CTGGAGATAA 2220 
CTGGAAAGGA CGGATTGATA CTGTTCCAAA CACTCATTGG ATTCATCAGT GAAAGAAAGA 2280 
ACGCTCTGCT TAATAAGGCA ATATCTCAGA GGGAAATGAA CAACTTGGAA AACAATGGAT 2340 
TTTACAGGCT CAGTGAATTC AATGTCAGCA CGGAATACTA TGAGGGCAAG GTCTATGACT 2400 
TAACTCTTGA AGGAACTCCC TACTTTGCCA ATGGCATATT GACCCATAAC TCCCTGTACC 2460 
CCTCAATCAT CATCACCCAC AACGTCTCGC CGGATACGCT CAACAGAGAA GGATGCAAGG 2520 
AATATGACGT TGCCCCACAG GTCGGCCACC GCTTCTGCAA GGACTTCCCA GGATTTATCC 2580 
CGAGCCTGCT TGGAGACCTC CTAGAGGAGA GGCAGAAGAT AAAGAAGAAG ATGAAGGCCA 2640 
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CGATTGACCC GATCGAGAGG AAGCTCCTCG 
CAAACAGCAT CCTACCCGAG GAATGGCTTC 
TCAGGATTGG AGAGCTCATA GACCGGATGA 
AGGGCGAGAC GGAAGTGCTT GAGGTCAGTG 
CTAACAAGGC CGAGCTCAAG AGAGTAAAGG 
TCTACACCAT CAGACTGAAG TCGGGGAGGA 
TCTCTGTGAG AAACGGGGAG CTCGTTGAAG 
TCGTTGCAGT CCCGCGGAGA TTGGAGCTTC 
AACTGCTCCT TGGAACGCCA GAAGAAGAAA 
AGGGTAAGAA GAACTTCTTT AAAGGGATGC 
AAAAGAGGCC CAGAACCGCG AGACGCTATC 
GGCTTAAGAA GATCGGCTAC GAAGTCCTCG 
TCTACGAGGC GCTTGTCGAG AACGTCAGAT 
AATTCAATTC CATCCGGGAT GCAGTTGGCA 
AGATCGGCAC GCTGAACGGC TTCAGAATGA 
CAAAGCTCCT CGGCTACTAC GTGAGCGAGG 
ACGGCTGGAG CTACAGCGTG AAGCTCTACA 
AGAGACTCGC CAGCAGGTTT TTCGGGAAGG 
CGAAGAAGAT CGGCTACCTG CTCTTTGAGA 
GGATTCCCGA GTTCGTCTTC ACGTCCCCGA 
ACTCATCGGC GATGGCGACG TCCACCGAAC 
CTTTAGCGAA CCAGCTCGTC CTCCTCTTGA 
GGCACGACAG CGGCGTTTAC AGGGTCTATA 
ACAAGAAAAA GAACGCCTAC TACTCACACG 
TTGGGAAGGT TTTCCAGAAA AACGTCAGTC 
GAAGACTCGA TCCCGAAAAG GCCCAGAGGC 
TCGACCGCGT TGAGTCCGTT GATGTGGAAG 
TCGAGGACAA CGAGAACTTC CTCGTTGGCT 
ACGGTTACTA CGGCTATGCA AGGGCGCGCT 



ATTACAGGCA GAGGGCCATC 


AAGATCCTGG 


2700 


CAGTCCTCGA GGAAGGGGAG 


GTTCACTTCG 


2760 


TGGAGGAAAA TGCTGGGAAA 


GTAAAGAGAG 


2820 


GGCTTGAAGT CCCGTCCTTT 


AACAGGAGAA 


2880 


CCCTGATTAG GCACGATTAT 


TCTGGCAAGG 


2940 


GAATAAAGAT AACCTCTGGC 


CACAGCCTCT 


3000 


TTACGGGCGA TGAACTAAAG 


CCAGGTGACC 


3060 


CTGAGAGAAA CCACGTGCTG 


AACCTCGTTG 


3120 


CTTTGGACAT CGTCATGACG 


ATCCCAGTCA 


3180 


TCAGGACTTT GCGCTGGATT 


TTCGGAGAGG 


3240 


TCAGGCACCT TGAGGATCTG 


GGCTATGTCC 


3300 


ACTGGGACTC ACTTAAGAAC 


TACAGAAGGC 


3360 


ACAACGGCAA CAAGAGGGAG 


TACCTCGTTG 


3420 


TAATGCCCCT AAAAGAGCTG 


AAGGAGTGGA 


3480 


GAAAGCTCAT TGAAGTGGAC 


GAGTCGTTAG 


3540 


GCTATGCAAG AAAGCAGAGG 


AATCCCAAAA 


3600 


ACGAAGACCC TGAAGTGCTG 


GACGATATGG 


3660 


TGAGGCGGGG CAGGAACTAC 


GTTGAGATAC 


3720 


ACATGTGCGG TGTCCTAGCG 


GAGAACAAGA 


3780 


AAGGGGTTCG, GCTGGCCTTC 


CTTGAGGGGT 


3840 


AAGAGACTCA GGCTCTCAAC 


GAAAAGCGAG 


3900 


ACTCGGTGGG GGTCTCTGCT 


GTAAAACTTG 


3960 


TAAACGAGGA GCTCCCGTTC 


GTAAAGCTGG 


4020 


TGATCCCCAA GGAAGTCCTG 


AGCGAGGTCT 


4080 


CTCAGACCTT CAGGAAGATG 


GTCG/VGGACG 


4140 


TCTCCTGGCT CATTGAGGGG 


GACGTAGTGC 


4200 


ACTACGATGG TTATGTCTAT 


GACCTGAGCG 


4260 


TTGGGTTGGT CTATGCTCAC 


AACAGCTACT 


4320 


GGTACTGCAA GGAGTGTGCA 


GAGAGCGTAA 


4380 
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CGGCCTGGGG AAGGGAGTAC ATAACGATGA CCATCAAGGA GATAGAGGAA AAGTACGGCT 4440 
TTAAGGTAAT CTACAGCGAC ACCGACGGAT TTTTTGCCAC AATACCTGGA GCCGATGCTG 4500 
AAACCGTCAA AAAGAACGCT ATGGAGTTCC TCAACTATAT CAACGCCAAA CTTCCGGGCG 4560 
CGCTTGAGCT CGAGTACGAG GGCTTCTACA AACGCGGCTT CTTCGTCACG AAGAAGAAGT 4620 

10 ATGCGGTGAT AGACGAGGAA GGCAAGATAA CAACGCGCGG ACTTGAGATT GTGAGGCGTG 4680 
ACTGGAGCGA GATAGCGAAA GAGACGCAGG CGAGGGTTCT TGAAGCTTTG CTAAAGGACG 4740 

, 5 GTGACGTCGA GAAGGCCGTG AGGATAGTCA AAGAAGTTAC CGAAAAGCTG AGCAAGTACG 4800 
AGGTTCCGCC GGAGAAGCTG GTGATCCACG AGCAGATAAC GAGGGATTTA AAGGACTACA 4860 
AGGCAACCGG TCCCCACGTT GCCGTTGCCA AGAGGTTGGC CGCGAGAGGA GTCAAAATAC 4920 

so 

GCCCTGGAAC CGTGATAAGC TACATCGTGC TCAAGGGCTC TGGGAGGATA GGCGACAGGG 4980 
CGATACCGTT CGACGAGTTC GACCCGACGA AGCACAAGTA CGATGCCGAG TACTACATTG 5040 
25 AGAACCAGGT TCTCCCAGCC GTTGAGAGAA TTCTGAGAGC CTTCGGTTAC CGCAAGGAAG 5100 
ACCTGCGCTA CCAGAAGACG AGACAGGTTG GTTTGAGTGC TTGGCTGAAG CCGAAGGGAA 5160 
CTTGACCTTT CCATTTGTTT TCCAGCGGAT AACCCTTTAA CTTCCCTTTC AAAAACTCCC 5220 

30 

TTTAGGGAAA GACCATGAAG ATAGAAATCC GGCGGCGCCC GGTTAAATAC GCTAGGATAG 5280 
AAGTGAAGCC AGACGGCAGG GTAGTCGTCA CTGCCCCGAG GGTTCAACGT TGAGAAGTT 5339 

35 
40 
45 
50 
55 



8NSOOCID: <EP 0745675A2_I_> 



31 



SEQ ID No. 7 

Length: 24 base pairs 

Type: nucleic acid 

Topo logy : 1 inear 

Molecular Type: synthetic DNA 

Sequence Description: 

GGATTAGTGC CAATGGAAGG CGAC 24 
SEQ ID NO. 8 

Length: 24 base pairs 

Type: nucleic acid 

Topology: linear 

Molecular Type: synthetic DNA 

Sequence Description: 

GAGGGCGAAG TTTATTCCGA GCTT 24 
SEQ ID No. 9 

Length: 324 base pairs 
Type: nucleic acid (DNA) 
Strandedness : double 
Topology: linear 
Molecular Type: cDNA 
Sequence Description: 

GGATTAGTGC CAATGGAAGG CGACGAGGAG CTGAAAATGC TCGCCTTCGA CATTCAAACT 60 
CTCTACCATG AGGGCGAGGA GTTCGCCGAG GGGCCAATCC TTATGATAAG CTACGCCGAC 120 
GAGGAAGGGG CCAGGGTGAT AACTTGGAAG AACGTGGATC TCCCCTACGT TGACGTCGTC 180 
TCGACGGAGA GGGAGATGAT AAAGCGCTTC CTCCGTGTTG TGAAGGAGAA AGACCCGGAC 240 
GTTCTCATAA CCTACAACGG CGACAACTTC GACTTCGCCT ATCTGAAAAA GCGCTGTGAA 300 
AAGCTCGGAA TAAACTTCGC CCTC 324 
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SEQ ID No. 10 
Length: 108 base pairs 
Type: amino acid 
Topology: linear 
Molecular Type: protein 
Sequence Description: 

Gly Leu Val Pro Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe 

15 10 15 

Asp He Gin Thr Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro 

20 25 . 30 

He Leu Met He Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val He Thr 

35 40 45 

Trp Lys Asn Val Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Arg 

50 55 60 

Glu Met He Lys Arg Phe Leu Arg Val Val Lys Glu Lys Asp Pro Asp 
65 70 75 80 

Val Leu He Thr Tyr Asn Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys 

85 90 95 

Lys Arg Cys Glu Lys Leu Gly He Asn Phe Ala Leu 
100 105 
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SEQ ID No. 11 
Length: 42 base pairs 
Type: nucleic acid (DNA) 
Strandedness : single 
Molecular Type: synthetic DNA 
Sequence Description: 

GCCATCAAGA TCCTGGCAAA CAGCTACTAC GGTTACTACG GC 42 
15 SEQ ID No. 12 

Length: 32 base pairs 
Type: nucleic acid (DNA) 
Strandedness: single 
Molecular Type: synthetic DNA 
Sequence Description: 

GATGGATCCA ACTTCTCAAC GTTGAACCCT CG 32 
SEQ ID No. 13 
30 Length: 46 base pairs 

Type: nucleic acid (DNA) 
Strandedness : single 
Molecular Type: synthetic DNA 
Sequence Description: 

GAACATAGTG TACCTAGATT TTAGATCCCT GTACCCCTCA ATCATC 46 
SEQ ID No. 14 
Length: 42 base pairs 
45 Type: nucleic acid (DNA) 

Strandedness : single 
Molecular Type: synthetic DNA 
Sequence Description: 

GCCGTAGTAA CCGTAGTAGC TGTTTGCCAG GATCTTGATG GC 42 



35 
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SEQ ID No. 15 

Length: 33 base pairs 

Type: nucleic acid (DNA) 

Strandedness : single 

Molecular Type: synthetic DNA 

Sequence Description: 

ATCGATATCC TCGACACTGA CTACATAACC GAG 
SEQ ID NO- 16 
Length: 46 base pairs 
Type: nucleic acid (DNA) 
Strandedness : single 
Molecular Type: synthetic DNA 
Sequence Description: 

GATGATTGAG GGGTACAGGG ATCTAAAATC TAGGTACACT ATGTTC 



Claims 

1 A DNA comprising a nucleotide sequence that encodes a thermostable DNA polymerase having a DNA extension 
rate of at least 30 bases/second and a 3'-5' exonuclease activity derived from a hyperthermophilic archaeon strain. 

2. The DNA of claim 1 . wherein the strain is KOD1 . 

3. The DNA of claim 1 or 2. wherein a nucleotide sequence encodes the thermostable DNA polymerase which has 
the amino acid sequence of SEQ ID No. 1 . 

4. The DNA of any one of claims 1 to 3. wherein said DNA has a nucleotide sequence of SEQ ID No. 5. 

5. A DNA encoding a thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 
3'-5' exonuclease activity, said DNA comprising a nucleotide sequence: 

(a) which differs from a DNA of claim 4 in codon sequence due to the degeneracy of the genetic code; 

(b) which hybridizes with a DNA of claim 4 or section (a), above; or 

(c) represents a fragment, allelic or other variation of a DNA of claim 4. 

6. A recombinant DNA expression vector that comprises the DNA sequence of any one of claims 1 to 5. 

7. The recombinant DNA expression vector of claim 6. in which the vector is a vector derived from pET-8c. 

8. A recombinant host ceil which is transformed by a recombinant DNA expression vector of claim 7. 

9. The recombinant host cell of claim 8, in which the host cell is Escherichia coli. 
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1 0. A thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3 *5' exonuclease 
activity which is encoded by the DNA of any one of claims 1 to 5. 

1 1 . The thermostable DNA polymerase of claim 10, which has the following physical and chemical properties: 
Action: It catalyses the extension reaction of nucleotide sequence that is complementary to a template nucleotide 
sequence using nucleotide triphosphates as substrate and tt has a 3'-5' exonuclease activity, 

DNA extension rate: at least 30 bases/second 

Optimum pH: 6.5 - 7.5 (at 75°C) 

Optimum temperature: 75°C 

Molecular weight: about 88 - £0 kDa 

Amino acid sequence: as defined in SEQ ID No. 1. 

12. A method for producing a DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3*-5' 
exonuclease activity comprising culturing the recombinant host cell of claim 8 or 9. 

1 3. A method for purifying a DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3'-5' exo- 
nuclease activity comprising culturing the recombinant host cell of claim 8 or 9, and further (a) recovering the cul- 
tured recombinant host cells, disintegrating them and preparing the cell extract, and (b) removing the impurified 
proteins derived from recombinant host cells. 

14. The DNA polymerase of claim 10 or 1 1 , which is obtainable by using the recombinant host cells of claim 8 or 9 or 
by the method of claims 12 or 13. 



15. 



A method for amplifying a target nucleic add comprising reacting the target nucleic acid with four kinds of dNTP 
and primer sufficiently complementary to said target nucleic acid in a buffer solution which contains a thermostable 
DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3-5' exonuclease activity such 
that the above mentioned primer is annealed to the target nucleic acid and an extention product is synthesized from 
the primer. 

16. A method for amplifying a target nucleic acid in a sample wherein each target nucleic acid consists of two separate 
complementary strands which comprises the following steps (a) to (d), characterized in that a thermostable DNA 
polymerase having a DNA extension rate of at least 30 bases/second and a 3'-5' exonuclease activity is used as a 
thermostable DNA polymerase: 

(a) modifying the target nucleic acid, if necessary, to produce single-stranded nucleic acids; 

(b) reacting the single-stranded nucleic acids with four kinds of dNTP and primers, wherein said primers are 
selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith 
in a buffer solution which contains said thermostable DNA polymerase such that the above mentioned primers! 
are annealed to the single-stranded nucleic acids and extention products are synthesized from the primers; 

(c) separating the primer extention products from the templates on which they are synthesized to produce sin- 
gle-stranded nucleic acids; and 

(d) repeatedly conducting the above mentioned steps (b) and (c). 

17. A method for detecting a target nucleic acid in a sample wherein each target nucleic acid consists of two separate 
complementary strands which comprises the following steps (a) to (e), characterized in that a thermostable DNA 
polymerase having a DNA extension rate of at least 30 bases/second and a 3'-5' exonuclease activity is used as a 
thermostable DNA polymerase: 

(a) modifying the target nucleic add, if necessary, to produce single-stranded nucleic acids; 

(b) reacting the single-stranded nucleic acids with four kinds of dNTP and primers, wherein said primers are 
selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith 
in a buffer solution which contains said thermostable DNA polymerase such that the above mentioned primers 
are annealed to the single-stranded nucleic adds and extention products are synthesized from the primers- 

(c) separating the primer extention products from the templates on which they are synthesized to produce sin- 
gle-stranded nucleic acids; 

(d) repeatedly conducting the above mentioned steps (b) and (c); and 

(e) detecting an amplified nudeic acid. 

18- The method of any one of claims 15 to 17, wherein said thermostable DNA polymerase is encoded by the DNA of 
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any one of claims 1 to 5 or produced by the method of daim 1 2 or 13. 

19. A reagent kit for amplifying target nucleic acid which comprises primers, wherein said primers are selected so as 
to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, fair kinds of dNTP. 
divalent cation and the thermostable DNA polymerase encoded by the DNA of any one of claims 1 to 5 or produced 
by the method of claim 12 or 13. 

20. A reagent kit for detecting target nucleic acid which comprises primers, wherein said primers are selected so as to 
be sufficiently complementary to different strands of target nucleic acid to anneal therewith, four kinds of dNTP, 
divalent cation, thermostable DNA polymerase encoded by the DNA of any one of claims 1 to 5 or produced by the 
method of claim 12 or 13. amplifying buffer solution, a probe capable of hybridizing with amplified nucleic acid and 
a detection buffer solution. 
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Fig. 1 



Measurement of the DNA Extension Rate of DNA Polymerase 
Derived from the Hyperthermophilic Archaeon Strain KOD1 
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Comparison of DMA Extension Rates of Various Thermostable 
DNA Polymerases 
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Fig. 3 

Comparison of the PCR due to the Difference in the Reaction Time 
of Various Thermostable DNA Polymerases 



Ml 234567S9l0lll2M 




1: ROD Polymerase (Derived from Hyperthextrophilic archaeon strain K0D1) Reaction: Time: 1 sec. 

2: KOD Polymerase (Derived from Hyper thermophilic archaeon strain MODI) ■ " : 5 sec. 

3: KOD Polymerase (Derived from Hyper thermophilic archaeon strain MODI) " " : 10 sec 

4: Taq Polymerase (Derived from Thermus aquaticus) Reaction Time: 1 sec. 

5: Taq Polymerase (Derived from Thermus aquaticus) " " : 5 sec* 

6: Taq Polymerase (Derived from Thermus aquaticus) n m : 10 sec. 

7: Taq Polymerase (Derived from Thermus aquaticus) " " : 30 sec. 

8: Taq Polymerase (Derived from Thermus aquaticus) " " : 60 sec. 

9: Pfu Polymerase (Derived from Pyroccccus furiosus) " " : 30 sec. 

10: Pfu Polymerase ( Derived from Pyroccccus furiosus) " " : 60 sec. 

11: Pfu Polymerase (Derived from Pyroccccus furiosus) n M : 90 sec. 

12: Pfu Polymerase (Derived from Pyroccccus furiosus) " " :120 sec. 
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Fig. 4 

Construction of Expression Recombinant Vector (pCT-pol) 
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Fig. 5 




1: pET-8c Precipitate 

2: pET-pol (AIVS-A,AIVS-B) Precipitate 

3: pET-8c Supernatant Liquid 

4: pET-8c Supernatant Liquid x 5 

5: pET-pol (MVS- A, AIVS-B) Supernatant Liquid 

6: pET-pol ( AIVS-A, AIVS-B) Supernatant Liquid x 5 
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Fig. 6 




1: Vent Polymerase (Derived from Ttiermococcus litoralis) 

2: pCT-pol (AIVS-A, AIVS-B) Supernatant Liquid 

3: pCT-pol (AIVS-A, AIVS-B) Supernatant Liquid x 5 

4: pET-8c Supernatant Liquid 

5: pET-8c Supernatnat Liquid x 5 
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Fig. 7 



DMA Polymerase Gene of Hyperthermophilic Archaeon Strain K0D1 
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DNA Polymerase Gene of Thermococcus litoralis (Vent DNA Polymerase) 
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Fig. 1 

Measurement of the DNA Extension Rata a£ DNA. Polymerase 
Derived from the Hyp&rthfermoptiilic Axcbaeon Strain* KOD1 
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Fig- 3 



CeopariBon of th* ICR to the Dlffer-erase in tho Kcsctinn r.vns 
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Pig- 5 




1: pBT-Bc Precipitate 

2: pET-pCd (iIVS-A,4XVS-ai Precipitate 

3: pFT-Sc Supernatant Liquid 

4: ££T-$c Supernatant Liquid X 5 

5; pST-pol <AIVS-ft f &IVB-8) Sj(pexnatant Liquid 

6: pEJVpOl {AIVS-A, &XV5-B) SuparriafcfarYt Liquid * 5 
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Fig. 6 




If vent Polymerase (Derived frxxn Tt^ocooeue litarali*} 

2: per-pol (A1VS-A, filVS-B) Supernatant Liquid 
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extension rate and a 3'-5' exonuclease activity derived 
from an hyperthermophilic archaeon strain KOD1 , to a 
method of amplifying and also of detecting the nucleic 
acid using said enzyme and to a reagent kit used for 
those methods. 
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